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(B) FILING DATE: 03-APR-1990 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME : Kohli, Vineet 

(B) REGISTRATION NUMBER: 37,003 

(C) REFERENCE/DOCKET NUMBER: SD9951IA 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 732-594-3889 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 166.. 1755 

(D) OTHER INFORMATION: /product= "ALPHA- 2 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CAATGACCTG 


TTTTCTTCTG 


TAACCACAGG 


TTCGGTGGTG 


AGAGGAASCY 


TCGCAGAATC 


60 


CAGCAGAATC 


CTCACAGAAT 


CCAGCAGCAG 


CTCTGCTGGG 


GACATGGTCC 


ATGGTGCAAC 


120 


CCACAGCAAA 


GCCCTGACCT 


GACCTCCTGA 


TGCTCAGGAG 


AAGCCATGGG 


CCCCTCCTGT 


180 


CCTGTGTTCC 


TGTCCTTCAC 


AAAGCTCAGC 


CTGTGGTGGC 


TCCTTCTGAC 


CCCAGCAGGT 


240 


GGAGAGGAAG 


CTAAGCGCCC 


ACCTCCCAGG 


GCTCCTGGAG 


ACCCACTCTC 


CTCTCCCAGT 


300 


CCCACGGCAT 


TGCCGCAGGG 


AGGCTCGCAT 


ACCGAGACTG 


AGGACCGGCT 


CTTCAAACAC 


360 


CTCTTCCGGG 


GCTACAACCG 


CTGGGCGCGC 


CCGGTGCCCA 


ACACTTCAGA 


CGTGGTGATT 


420 


GTGCGCTTTG 


GACTGTCCAT 


CGCTCAGCTC 


ATCGATGTGG 


ATGAGAAGAA 


CCAAATGATG 


480 


ACCACCAACG 


TCTGGCTAAA 


ACAGGAGTGG 


AGC G AC T AC A 


AACTGCGCTG 


GAACCCCGCT 


540 


GATTTTGGCA 


ACATCACATC 


TCTCAGGGTC 


CCTTCTGAGA 


TGATC TGG AT 


CCCCGACATT 


600 


GTTCTCTACA 


ACAATGCAGA 


TGGGGAGTTT 


GCAGTGACCC 


ACATGACCAA 


GGCCCACCTC 


660 


TTCTCCACGG 


GCACTGTGCA 


CTGGGTGCCC 


CCGGCCATCT 


AC AAGAGC TC 


CTGCAGCATC 


720 


GACGTCACCT 


TCTTCCCCTT 


CGACCAGCAG 


AACTGCAAGA 


TGAAGTTTGG 


CTCCTGGACT 


780 


TATGACAAGG 


CCAAGATCGA 


CCTGGAGCAG 


ATGGAGCAGA 


CTGTGGACCT 


G AAGG AC T AC 


840 


TGGGAGAGCG 


GCGAGTGGGC 


CATCGTCAAT 


GCCACGGGCA 


CCTACAACAG 


CAAGAAGTAC 


900 


GACTGCTGCG 


CCGAGATCTA 


CCCCGACGTC 


ACCTACGCCT 


TCGTCATCCG 


GCGGCTGCCG 


960 
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CTCTTCTACA 


CCATCAACCT 


CATCATCCCC 


TGCCTGCTCA 


TCTCCTGCCT 


CACTGTGCTG 


1020 


GTCTTCTACC 


TGCCCTCCGA 


CTGCGGCGAG 


AAGATCACGC 


TGTGCATTTC 


GGTGCTGCTG 


1080 


TCACTCACCG 


TCTTCCTGCT 


GCTCATCACT 


GAGATCATCC 


CGTCCACCTC 


GCTGGTCATC 


1140 


CCGCTCATCG 


GCGAGTACCT 


GCTGTTCACC 


ATGATCTTCG 


TCACCCTGTC 


CATCGTCATC 


1200 


ACCGTCTTCG 


TGCTCAATGT 


GCACCACCGC 


TCCCCCAGCA 


CCCACACCAT 


GCCCCACTGG 


1260 


GTGCGGGGGG 


CCCTTCTGGG 


CTGTGTGCCC 


CGGTGGCTTC 


TGATGAACCG 


GCCCCCACCA 


1320 


CCCGTGGAGC 


TCTGCCACCC 


CCTACGCCTG 


AAGCTCAGCC 


CCTCTTATCA 


CTGGCTGGAG 


1380 


AGCAACGTGG 


ATGCCGAGGA 


GAGGGAGGTG 


GTGGTGGAGG 


AGGAGGACAG 


ATGGGCATGT 


1440 


GCAGGTCATG 


TGGCCCCCTC 


TGTGGGCACC 


CTCTGCAGCC 


ACGGCCACCT 


GCACTCTGGG 


1500 


GCCTCAGGTC 


CCAAGGCTGA 


GGCTCTGCTG 


CAGGAGGGTG 


AGCTGCTGCT 


ATCACCCCAC 


1560 


ATGCAGAAGG 


CACTGGAAGG 


TGTGCACTAC 


ATTGCCGACC 


ACCTGCGGTC 


TGAGGATGCT 


1620 


GACTCTTCGG 


TGAAGGAGGA 


CTGGAAGTAT 


GTTGCCATGG 


TCATCGACAG 


GATCTTCCTC 


1680 


TGGCTGTTTA 


TCATCGTCTG 


CTTCCTGGGG 


ACCATCGGCC 


TCTTTCTGCC 


TCCGTTCCTA 


1740 


GCTGGAATGA 


TCTGACTGCA 


CCTCCCTCGA 


GCTGGCTCCC 


AGGGCAAAGG 


GGAGGGTTCT 


1800 


TGGATGTGGA 


AGGGCTTTGA 


ACAATGTTTA 


GATTTGGAGA 


TGAGCCCAAA 


GTGCCAGGGA 


1860 


GAACAGCCAG 


GTGAGGTGGG 


AGGTTGGAGA 


GCCAGGTGAG 


GTCTCTCTAA 


GTCAGGCTGG 


1920 


GGTTGAAGTT 


TGGAGTCTGT 


CCGAGTTTGC 


AGGGTGC TGA 


GCTGTATGGT 


CCAGCAGGGG 


1980 


AGTAATAAGG 


GCTCTTCCGG 


AAGGGGAGGA 


AGCGGGAGGC 


AGGGCCTGCA 


CCTGATGTGG 


2040 


AGGTACAGGG 


CAGATCTTCC 


CTACCGGGGA 


GGGATGGATG 


GTTG GAT AC A 


GGTGGCTGGG 


2100 


CTATTCCATC 


CATC TGGAAG 


CACATTTGAG 


CCTCCAGGCT 


TCTCCTTGAC 


GTCATTCCTC 


2160 


TCCTTCCTTG 


CTCCAAAATG 


GCTCTGCACC 


AGCCGGCCCC 


CAGGAGGTCT 


GGCAGAGCTG 


2220 


AGAGCCATGG 


CCTGCAGGGG 


CTCCATATGT 


CCCTACGCGT 


GCAGCAGGCA 


AACAAGA 


2277 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 

( D ) TO POLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly Pro Ser Cys Pro Val Phe Leu Ser Phe Thr Lys Leu Ser Leu 
15 10 15 

Trp Trp Leu Leu Leu Thr Pro Ala Gly Gly Glu Glu Ala Lys Arg Pro 
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20 25 30 

Pro Pro Arg Ala Pro Gly Asp Pro Leu Ser Ser Pro Ser Pro Thr Ala 
35 40 45 

Leu Pro Gin Gly Gly Ser His Thr Glu Thr Glu Asp Arg Leu Phe Lys 
50 55 60 

His Leu Phe Arg Gly Tyr Asn Arg Trp Ala Arg Pro Val Pro Asn Thr 
65 70 75 80 

Ser Asp Val Val lie Val Arg Phe Gly Leu Ser lie Ala Gin Leu lie 
85 90 95 

Asp Val Asp Glu Lys Asn Gin Met Met Thr Thr Asn Val Trp Leu Lys 
100 105 110 

Gin Glu Trp Ser Asp Tyr Lys Leu Arg Trp Asn Pro Ala Asp Phe Gly 
115 120 125 

Asn He Thr Ser Leu Arg Val Pro Ser Glu Met He Trp He Pro Asp 
130 135 140 

He Val Leu Tyr Asn Asn Ala Asp Gly Glu Phe Ala Val Thr His Met 
145 150 155 160 

Thr Lys Ala His Leu Phe Ser Thr Gly Thr Val His Trp Val Pro Pro 
165 170 175 

Ala He Tyr Lys Ser Ser Cys Ser He Asp Val Thr Phe Phe Pro Phe 
180 185 190 

Asp Gin Gin Asn Cys Lys Met Lys Phe Gly Ser Trp Thr Tyr Asp Lys 
195 200 205 

Ala Lys He Asp Leu Glu Gin Met Glu Gin Thr Val Asp Leu Lys Asp 
210 215 220 

Tyr Trp Glu Ser Gly Glu Trp Ala He Val Asn Ala Thr Gly Thr Tyr 
225 230 235 240 

Asn Ser Lys Lys Tyr Asp Cys Cys Ala Glu He Tyr Pro Asp Val Thr 
245 250 255 

Tyr Ala Phe Val He Arg Arg Leu Pro Leu Phe Tyr Thr He Asn Leu 
260 265 270 

He He Pro Cys Leu Leu He Ser Cys Leu Thr Val Leu Val Phe Tyr 
275 280 285 

Leu Pro Ser Asp Cys Gly Glu Lys He Thr Leu Cys He Ser Val Leu 
290 295 300 

Leu Ser Leu Thr Val Phe Leu Leu Leu He Thr Glu He He Pro Ser 
305 310 315 320 

Thr Ser Leu Val He Pro Leu He Gly Glu Tyr Leu Leu Phe Thr Met 
325 330 335 

He Phe Val Thr Leu Ser He Val He Thr Val Phe Val Leu Asn Val 
340 345 350 
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His His Arg Ser Pro Ser Thr His Thr Met Pro His Trp Val Arg Gly 
355 360 365 

Ala Leu Leu Gly Cys Val Pro Arg Trp Leu Leu Met Asn Arg Pro Pro 
370 375 380 

Pro Pro Val Glu Leu Cys His Pro Leu Arg Leu Lys Leu Ser Pro Ser 
385 390 395 400 

Tyr His Trp Leu Glu Ser Asn Val Asp Ala Glu Glu Arg Glu Val Val 
405 410 415 

Val Glu Glu Glu Asp Arg Trp Ala Cys Ala Gly His Val Ala Pro Ser 
420 425 430 

Val Gly Thr Leu Cys Ser His Gly His Leu His Ser Gly Ala Ser Gly 
435 440 445 

Pro Lys Ala Glu Ala Leu Leu Gin Glu Gly Glu Leu Leu Leu Ser Pro 
450 455 460 

His Met Gin Lys Ala Leu Glu Gly Val His Tyr lie Ala Asp His Leu 
465 470 475 480 

Arg Ser Glu Asp Ala Asp Ser Ser Val Lys Glu Asp Trp Lys Tyr Val 
485 490 495 

Ala Met Val lie Asp Arg lie Phe Leu Trp Leu Phe lie lie Val Cys 
500 505 510 

Phe Leu Gly Thr lie Gly Leu Phe Leu Pro Pro Phe Leu Ala Gly Met 
515 520 525 

He 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 39.. 1553 

(D) OTHER INFORMATION: /product= "ALPHA- 3 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CCGACCGTCC GGGTCCGCGG CCAGCCCGGC CACCAGCCAT GGGCTCTGGC CCGCTCTCGC 60 

TGCCCCTGGC GCTGTCGCCG CCGCGGCTGC TGCTGCTGCT GCTGTCTCTG CTGCCAGTGG 12 0 

CCAGGGCCTC AGAGGCTGAG CACCGTCTAT TTGAGCGGCT GTTTGAAGAT TACAATGAGA 180 
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TCATCCGGCC 


TGTAGCCAAC 


GTGTCTGACC 


CAGTCATCAT 


CCATTTCGAG 


GTGTCCATGT 


240 


CTCAGCTGGT 


GAAGGTGGAT 


GAAGTAAACC 


AGATCATGGA 


GACCAACCTG 


TGGCTCAAGC 


300 


AAATCTGGAA 


TGACTACAAG 


CTGAAGTGGA 


ACCCCTCTGA 


CTATGGTGGG 


GCAGAGTTCA 


360 


TGCGTGTCCC 


TGCACAGAAG 


ATCTGGAAGC 


CAGACATTGT 


GCTGTATAAC 


AATGCTGTTG 


420 


GGGATTTCCA 


GGTGGACGAC 


AAGACCAAAG 


CCTTACTCAA 


GT AC AC TGGG 


G AGGTG AC TT 


480 


GGATACCTCC 


GGCCATCTTT 


AAGAGC TCC T 


GTAAAATCGA 


CGTGACCTAC 


TTCCCGTTTG 


540 


ATTACCAAAA 


CTGTACCATG 


AAGTTCGGTT 


CCTGGTCCTA 


CGATAAGGCG 


AAAATCGATC 


600 


TGGTCCTGAT 


CGGCTCTTCC 


ATGAACCTCA 


AGGACTATTG 


GGAGAGCGGC 


GAGTGGGCCA 


660 


TCATCAAAGC 


CCCAGGCTAC 


AAACACGACA 


TCAAGTACAG 


CTGCTGCGAG 


GAG AT C T AC C 


720 


CCGACATCAC 


ATACTCGCTG 


WWCATCCGGC 


GGCTGTCGTT 


GTTCTACACC 


ATCAWCCTCA 


780 


TCATCCGC'TG 


GCTGATCATC 


TCCTTCATCA 


CTGTGGTCGT 


CTCCTACCTG 


CCCTCCGACT 


o a r\ 


GCGGCGAGAA 


GGTGACCCTG 


TGYATTTCTG 


TCCTCCTCTC 


CCTGACGGTG 


TTTCTCCTGG 


900 


TGATCACTGA 


GACCATCCCT 


TCCACCTCGC 


TGGTCATCCC 


CCTGATTGGA 


GAGTACCTCC 


960 


TGWWCACCAT 


GATTTGTGTA 


AC C TTGTC C A 


TCGACATCAC 


CGTCTGCGTG 


CTCAACGTGC 


1020 


ACTACAGAAC 


CCCGACGACA 


CACACAATGC 


CCTCATGGGT 


GAAGACTGTA 


TTCTTGAMCC 


1080 


TGCTCCCCAG 


GGTCATGTWC 


ATGACCAGGC 


CAACAAGCAA 


CGAGGGCAAC 


GCTCAGAAGC 


1140 


CGAGGCCCCT 


CTACGGTGCC 


GAGCTCTCAA 


ATCTGAATTG 


CTTCAGCCGC 


GCAGAGTCCA 


1200 


AAGGCTGCAA 


GGAGGGCTAC 


CCCTGCCAGG 


AC GGG ATGTG 


TGGTTACTGC 


CACCACCGCA 


1260 


GGATAAAAAT 


CTCCAATTTC 


AGTGCTAACC 


TC AC GAGAAG 


CTCTAGTTCT 


GAATCTGTTG 


1320 


ATGCTGTGLT 


GTCCC 1LIL1 


(jtL i 1 I 1 LAL 


C* A A A A T^r* A A 






X J o u 


AGTATATTGC 


TGAAAATATG 


AAAGCACAAA 


ATGAAGCCAA 


AGAGATTCAA 


GATGATTGGA 


1440 


AGTATGTTGC 


CATGGTGATT 


GATCGTATTT 


TTCTGTGGGT 


TTTCACCCTG 


GTGTGCATTC 


1500 


TAGGGACAGC 


AGGATTGTTT 


CTGCAACCCC 


TGATGGCCAG 


GGAAGATGCA 


TAAGCACTAA 


1560 


GCTGTGTGCC 


TGCCTGGGAG 


ACTTCCTTGT 


GTCAGGGCAG 


GAGGAGGCTG 


CTTCCTAGTA 


1620 


AGAACGTACT 


TTCTGTTATC 


AAGCTACCAG 


CTTT 






1654 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 04 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
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Met Gly Ser Gly 
1 

Leu Leu Leu Leu 
20 



Ala Glu His Arg 
35 

lie Arg Pro Val 
50 

Val Ser Met Ser 
65 

Glu Thr Asn Leu 



Trp Asn Pro Ser 
100 

Gin Lys lie Trp 
115 

Asp Phe Gin Val 
130 

Glu Val Thr Trp 
145 

Asp Val Thr Tyr 



Gly Ser Trp Ser 
180 



Ser Ser Met Asn 
195 

lie Lys Ala Pro 
210 

Glu lie Tyr Pro 
225 

Leu Phe Tyr Thr 



lie Thr Val Val 
260 

Thr Leu Cys lie 
275 

lie Thr Glu Thr 
290 

Glu Tyr Leu Leu 
305 

Thr Val Cys Val 



Pro Leu Ser Leu 
5 

Leu Leu Ser Leu 



Leu Phe Glu Arg 
40 

Ala Asn Val Ser 
55 

Gin Leu Val Lys 
70 

Trp Leu Lys Gin 
85 

Asp Tyr Gly Gly 



Lys Pro Asp lie 
120 



Asp Asp Lys Thr 
135 

lie Pro Pro Ala 
150 

Phe Pro Phe Asp 
165 

Tyr Asp Lys Ala 



Leu Lys Asp Tyr 
200 

Gly Tyr Lys His 
215 

Asp lie Thr Tyr 
230 

lie Xaa Leu lie 
245 

Val Ser Tyr Leu 



Ser Val Leu Leu 
280 

lie Pro Ser Thr 
295 

Xaa Thr Met lie 
310 

Leu Asn Val His 



Pro Leu Ala Leu 
10 

Leu Pro Val Ala 
25 

Leu Phe Glu Asp 



Asp Pro Val lie 
60 

Val Asp Glu Val 
75 

lie Trp Asn Asp 
90 

Ala Glu Phe Met 
1 05 

Val Leu Tyr Asn 



Lys Ala Leu Leu 
140 



lie Phe Lys Ser 
155 

Tyr Gin Asn Cys 
170 



Lys lie Asp Leu 
185 

Trp Glu Ser Gly 



Asp lie Lys Tyr 
220 

Ser Leu Xaa lie 
235 

lie Arg Trp Leu 
250 



Pro Ser Asp Cys 
265 

Ser Leu Thr Val 



Ser Leu Val lie 
300 

Cys Val Thr Leu 
315 

Tyr Arg Thr Pro 



Ser Pro Pro Arg 
15 

Arg Ala Ser Glu 
30 

Tyr Asn Glu lie 
45 

lie His Phe Glu 



Asn Gin lie Met 
80 



Tyr Lys Leu Lys 
95 

Arg Val Pro Ala 
110 

Asn Ala Val Gly 
125 

Lys Tyr Thr Gly 



Ser Cys Lys lie 
160 

Thr Met Lys Phe 
175 

Val Leu lie Gly 
190 

Glu Trp Ala lie 
205 

Ser Cys Cys Glu 



Arg Arg Leu Ser 
240 

lie lie Ser Phe 
255 

Gly Glu Lys Val 
270 

Phe Leu Leu Val 
285 

Pro Leu lie Gly 



Ser lie Asp lie 
320 

Thr Thr His Thr 
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325 330 335 

Met Pro Ser Trp Val Lys Thr Val Phe Leu Xaa Leu Leu Pro Arg Val 
340 345 350 

Met Xaa Met Thr Arg Pro Thr Ser Asn Glu Gly Asn Ala Gin Lys Pro 
355 360 365 

Arg Pro Leu Tyr Gly Ala Glu Leu Ser Asn Leu Asn Cys Phe Ser Arg 
370 375 380 

Ala Glu Ser Lys Gly Cys Lys Glu Gly Tyr Pro Cys Gin Asp Gly Met 
385 390 395 400 

Cys Gly Tyr Cys His His Arg Arg lie Lys lie Ser Asn Phe Ser Ala 
405 410 415 

Asn Leu Thr Arg Ser Ser Ser Ser Glu Ser Val Asp Ala Val Leu Ser 
420 425 430 

Leu Ser AX a Leu Ser Pro Glu lie Lys 

435 440 445 

Tyr lie Ala Glu Asn Met Lys Ala Gin Asn Glu Ala Lys Glu lie Gin 
450 455 460 

Asp Asp Trp Lys Tyr Val Ala Met Val lie Asp Arg lie Phe Leu Trp 
465 470 475 480 

Val Phe Thr Leu Val Cys lie Leu Gly Thr Ala Gly Leu Phe Leu Gin 
485 490 495 

Pro Leu Met Ala Arg Glu Asp Ala 
500 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 173.. 2056 

(D) OTHER INFORMATION: /product= "ALPHA- 4 SUBUNIT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GCGCTCGCTG CGGCGCCGCC GCCGCNCCGC GCGCCACAGG AG AAGGC G AN CCGGGCCCGG 60 

CGGCCGAAGC GGCCCGCGAG GCGCGGGAGG CATGAAGTTG GGCGCGCACG GGCCTCGAAG 12 0 

CGGCGGGGAG CCGGGAGCCG CCCGCATCTA GAGCCCGCGA GGTGCGTGCG CCATGGAGCT 180 

AGGGGGCCCC GGAGCGCCGC GGCTGCTGCC GCCGCTGCTG CTGCTTCTGG GGACCGGCCT 240 
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CCTGCGCGCC 


AGCAGCCATG 


TGGAGACCCG 


GGCCCACGCC 


GAGGAGCGGC 


TCCTGAAGAA 


300 


ACTCTTCTCC 


GGTTACAACA 


AGTGGTCCCG 


ACCCGTGGCC 


AACATCTCGG 


ACGTGGTCCT 


360 


CGTCCGCTTC 


GGCCTGTCCA 


TCGCTCAGCT 


CATTGACGTG 


GATGAGAAGA 


ACCAGATGAT 


420 


GACCACGAAC 


GTC TGGGTG A 


AGCAGGAGTG 


GCACGACTAC 


AAGCTGCGCT 


GGGACCCAGC 


480 


TGACTATGAG 


AATGTCACCT 


CCATCCGCAT 


CCCCTCCGAG 


CTCATCTGGC 


GGCCGGACAT 


540 


CGCCCTCTAC 


AACAATGCTG 


ACGGGGACTT 


CGCGGCCACC 


CACCTGACCA 


AGGCCCACCT 


600 


GTTCCATGAC 


GGGCGGGTGC 


AGCGGACTCC 


CCCGGCCATT 


T AC AAGAGC T 


CCTGCAGCAT 


660 


CGACGTCACC 


TTCTTCCCCT 


TCGACCAGCA 


GAACTGCACC 


ATG AAATTC G 


GCTCCTGGAC 


720 


CTACGACAAG 


GCCAAGATCG 


ACCTGGTGAA 


CATGCACAGC 


CGCGTGGACC 


AGCTGGACTT 


780 


CTGGGAGAGT 


GGCGAGTGGC 


TCATCTCGGA 


CGCCGTGGGC 


ACCTACAACA 


CCAGGAAGTA 


840 


CGAGTGCTGC 


GCCGAGATCT 


ACCCGGACAT 


CACCTATGCC 


TACGCCATCC 


GGCGGCTGCC 


900 


GCTCTTCTAC 


ACCATCAACC 


TCATCATCCC 


CTGGCTGCTC 


ATCTCCTGCC 


TCACCGCGCT 


960 


GGTCTTCTAC 


CTGCCCTCCG 


AGTGTGGCGA 


GAAGATCACG 


CTGTGCATCT 


CCGTGCTGCT 


1020 


GTCGCTCACC 


GTCTTCCTGC 


TGCTCATCAC 


CGAGATCATC 


CCGTCCACCT 


CACTGGTCAT 


1080 


CCCACTCATC 


GGCGAGTACC 


TGCTGTTCAC 


CATGATCTTC 


GTCACCCTGT 


CCATCGCCAT 


1140 


CACGGTCTTC 


GTGCTCAACG 


TGCACCACCG 


CTCGCCACGC 


ACGCACACCA 


TGCCCACCTG 


1200 


GGTACGCAGG 


GTCTTCCTGG 


ACATCGTGCC 


ACGCCTGCTC 


CTCATGAAGC 


GGCCGTCCGT 


1260 


GGTCAAGGAC 


AATTGCCGGC 


GGCTCATCGA 


GTCCATGCAT 


AAGATGGCCA 


GTGCCCCGCG 


1320 


CTTCTGGCCC 


GAGCCAGAAG 


GGGAGCCCCC 


TGCCACGAGC 


GGCACCCAGA 


GCCTGCACCC 


1380 


TCCCTCACCG 


TCCTTCTGCG 


TCCCCCTGGA 


TGTGCCGGCT 


GAGCCTGGGC 


CTTCCTGCAA 


1440 


GTCACCCTCC 


GACCAGCTCC 


CTCCTCAGCA 


GCCCCTGGAA 


GCTGAGAAAG 


CCAGCCCCCA 


1500 


CCCCTCGCCT 


GGACCCTGCC 


GCCCGCCCCA 


CGGCACCCAG 


GCACCAGGGC 


TGGCCAAAGC 


1560 


CAGGTCCCTC 


AGCGTCCAGC 


ACATGTCCAG 


CCCTGGCGAA 


GC GGTGG AAG 


GCGGCGTCCG 


1620 


GTGCCGGTCT 


CGGAGCATCC 


AGTACTGTGT 


TCCCCGAGAC 


GATGCCGCCC 


CCGAGGCAGA 


1680 


TGGCCAGGCT 


GCCGGCGCCC 


TGGCCTCTCG 


CAACAGCCAC 


TCGGCTGAGC 


TCCCACCCCC 


1740 


AGACCAGCCC 


TCTCCGTGCA 


AATGCACATG 


CAAGAAGGAG 


CCCTCTTCGG 


TGTCCCCGAG 


1800 


CGCCACGGTC 


AAGACCCGCA 


GCACCAAAGC 


GCCGCCGCCG 


CACCTGCCCC 


TGTCGCCGGC 


1860 


CCTGAGCCGG 


GCGGTGGAGG 


GCGTCCAGTA 


CATTGCAGAC 


CACCTGAAGG 


CCGAAGACAC 


1920 


AGACTTCTCG 


GTGAAGGAGG 


AC TGGAAGTA 


CGTGGCCATG 


GTCATCGACC 


GCATCTTCCT 


1980 


CTGGATGTTC 


ATCATCGTCT 


GCCTGCTGGG 


GACGGTGGGC 


CTCTTCCTGC 


CGCCCTGGCT 


2040 


GGCTGGCATG 


ATCTAGGAAG 


GGACCGGGAG 


CCTGCGTGGC 


CTGGGGCTGC 


CGYGCACGGG 


2100 
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GCCAGCATCC ATGCGGCCGG CCTGGGGCCG GGCTGGCTTC TCCCTGGACT CTGTGGGGCC 2160 

ACACGTTTGC CAAATTTTCC TTCCTGTTCT GTGTCTGCTG TAAGACGGCC TTGGACGGCG 222 0 

ACACGGCCTC TGGGGAG AC C GAGTGTGGAG CTGCTTCCAG TTGGACTCTS GCCTCAGNAG 22 80 

GCAGCGGCTT GGAGCAGAGG TGGCGGTCGC CGCCTYCTAC CTGCAGGACT CGGGCTAAGT 2340 

CCAGCTCTCC CCCTGCGCAG CCC 23 63 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Glu Leu Gly Gly Pro Gly Ala Pro Arg Leu Leu Pro Pro Leu Leu 
15 10 15 

Leu Leu Leu Gly Thr Gly Leu Leu Arg Ala Ser Ser His Val Glu Thr 
20 25 30 

Arg Ala His Ala Glu Glu Arg Leu Leu Lys Lys Leu Phe Ser Gly Tyr 
35 40 45 

Asn Lys Trp Ser Arg Pro Val Ala Asn lie Ser Asp Val Val Leu Val 
50 55 60 

Arg Phe Gly Leu Ser lie Ala Gin Leu lie Asp Val Asp Glu Lys Asn 
65 70 75 80 

Gin Met Met Thr Thr Asn Val Trp Val Lys Gin Glu Trp His Asp Tyr 
85 90 95 

Lys Leu Arg Trp Asp Pro Ala Asp Tyr Glu Asn Val Thr Ser lie Arg 
100 105 110 

lie Pro Ser Glu Leu lie Trp Arg Pro Asp lie Ala Leu Tyr Asn Asn 
115 120 125 

Ala Asp Gly Asp Phe Ala Ala Thr His Leu Thr Lys Ala His Leu Phe 
130 135 140 

His Asp Gly Arg Val Gin Arg Thr Pro Pro Ala lie Tyr Lys Ser Ser 
145 150 155 160 

Cys Ser lie Asp Val Thr Phe Phe Pro Phe Asp Gin Gin Asn Cys Thr 
165 170 175 

Met Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Lys lie Asp Leu Val 
180 185 190 

Asn Met His Ser Arg Val Asp Gin Leu Asp Phe Trp Glu Ser Gly Glu 
195 200 205 
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Trp Leu lie Ser 
210 

Cys Cys Ala Glu 
225 



Arg Leu Pro Leu 



lie Ser Cys Leu 
260 



Glu Lys lie Thr 
275 

Leu Leu Leu lie 
290 

Leu lie Gly Glu 
3 05 

lie Ala lie Thr 



Thr His Thr Met 
340 



Pro Arg Leu Leu 
355 

Arg Arg Leu lie 
370 

Trp Pro Glu Pro 
385 



Leu His Pro Pro 



Glu Pro Gly Pro 
420 



Gin Pro Leu Glu 
435 

Cys Arg Pro Pro 
450 

Ser Leu Ser Val 
465 

Gly Val Arg Cys 



Asp Ala Ala Pro 
500 

Arg Asn Ser His 
515 



Cys Lys Cys Thr 



Asp Ala Val Gly 
215 



lie Tyr Pro Asp 
230 

Phe Tyr Thr lie 
245 

Thr Ala Leu Val 



Leu Cys lie Ser 
280 



Thr Glu lie lie 
295 

Tyr Leu Leu Phe 
310 

Val Phe Val Leu 
325 

Pro Thr Trp Val 



Leu Met Lys Arg 
360 

Glu Ser Met His 
375 

Glu Gly Glu Pro 
390 

Ser Pro Ser Phe 
405 

Ser Cys Lys Ser 



Ala Glu Lys Ala 
440 



His Gly Thr Gin 
455 

Gin His Met Ser 
470 

Arg Ser Arg Ser 
485 

Glu Ala Asp Gly 



Ser Ala Glu Leu 
520 

Cys Lys Lys Glu 



Thr Tyr Asn Thr 
220 



lie Thr Tyr Ala 
235 

Asn Leu lie lie 
250 

Phe Tyr Leu Pro 
265 

Val Leu Leu Ser 



Pro Ser Thr Ser 
300 

Thr Met lie Phe 
315 

Asn Val His His 
330 

Arg Arg Val Phe 
345 

Pro Ser Val Val 



Lys Met Ala Ser 
380 

Pro Ala Thr Ser 
395 

Cys Val Pro Leu 
410 

Pro Ser Asp Gin 
425 

Ser Pro His Pro 



Ala Pro Gly Leu 
460 



Ser Pro Gly Glu 
475 

lie Gin Tyr Cys 
490 

Gin Ala Ala Gly 
505 

Pro Pro Pro Asp 



Pro Ser Ser Val 



Arg Lys Tyr Glu 



Tyr Ala lie Arg 
240 



Pro Trp Leu Leu 
255 

Ser Glu Cys Gly 
270 

Leu Thr Val Phe 
285 

Leu Val lie Pro 



Val Thr Leu Ser 
32 G 



Arg Ser Pro Arg 
335 



Leu Asp lie Val 
350 



Lys Asp Asn Cys 
365 

Ala Pro Arg Phe 



Gly Thr Gin Ser 
400 



Asp Val Pro Ala 
415 

Leu Pro Pro Gin 
430 

Ser Pro Gly Pro 
445 

Ala Lys Ala Arg 



Ala Val Glu Gly 
480 



Val Pro Arg Asp 
495 

Ala Leu Ala Ser 
510 

Gin Pro Ser Pro 
525 

Ser Pro Ser Ala 
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530 535 540 

Thr Val Lys Thr Arg Ser Thr Lys Ala Pro Pro Pro His Leu Pro Leu 
545 550 555 560 

Ser Pro Ala Leu Ser Arg Ala Val Glu Gly Val Gin Tyr lie Ala Asp 
565 570 575 

His Leu Lys Ala Glu Asp Thr Asp Phe Ser Val Lys Glu Asp Trp Lys 
580 585 590 

Tyr Val Ala Met Val He Asp Arg He Phe Leu Trp Met Phe He He 
595 600 605 

Val Cys Leu Leu Gly Thr Val Gly Leu Phe Leu Pro Pro Trp Leu Ala 
610 615 620 

Gly Met He 
625 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 155.. 1561 

(D) OTHER INFORMATION: /product= "ALPHA- 5 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CCCGGCGGGA GCTGTGGCGC GGAGCGGCCC CGCTGCTGCG TCTGCCCTCG TTTTGTCTCA 6 0 

CGACTCACAC TCAGTGCTGC ATTCCCCAAG AGTTCGCGTT CCCCGCGCGG C GGTC GAG AG 12 0 

GCGGCTGCCC GCGGTCCCGC GCGGGCGCGG GGCG ATG GCG GCG CGG GGG TCA 172 

Met Ala Ala Arg Gly Ser 
1 5 

GGG CCC CGC GCG CTC CGC CTG CTG CTC TTG GTC CAG CTG GTC GCG GGG 22 0 

Gly Pro Arg Ala Leu Arg Leu Leu Leu Leu Val Gin Leu Val Ala Gly 
10 15 20 

CGC TGC GGT CTA GCG GGC GCG GCG GGC GGC GCG CAG AGA GGA TTA TCT 2 68 

Arg Cys Gly Leu Ala Gly Ala Ala Gly Gly Ala Gin Arg Gly Leu Ser 
25 30 35 

GAA CCT TCT TCT ATT GCA AAA CAT GAA GAT AGT TTG CTT AAG GAT TTA 316 
Glu Pro Ser Ser He Ala Lys His Glu Asp Ser Leu Leu Lys Asp Leu 
40 45 50 

TTT CAA GAC TAC GAA AGA TGG GTT CGT CCT GTG GAA CAC CTG AAT GAC 3 64 

Phe Gin Asp Tyr Glu Arg Trp Val Arg Pro Val Glu His Leu Asn Asp 
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55 60 65 70 

AAA ATA AAA ATA AAA TTT GGA CTT GCA ATA TCT CAA TTG GTG GAT GTG 412 
Lys lie Lys lie Lys Phe Gly Leu Ala lie Ser Gin Leu Val Asp Val 
75 80 85 

GAT GAG AAA AAT CAG TTA ATG AC A AC A AAC GTC TGG TTG AAA CAG GAA 460 
Asp Glu Lys Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lys Gin Glu 
90 95 100 

TGG ATA GAT GTA AAA TTA AGA TGG AAC CCT GAT GAC TAT GGT GGA ATA 508 
Trp lie Asp Val Lys Leu Arg Trp Asn Pro Asp Asp Tyr Gly Gly lie 
105 110 115 

AAA GTT ATA CGT GTT CCT TCA GAC TCT GTC TGG ACA CCA GAC ATC GTT 55 6 

Lys Val lie Arg Val Pro Ser Asp Ser Val Trp Thr Pro Asp lie Val 
120 125 130 

TTG TTT GAT AAT GCA GAT GGA CGT TTT GAA GGG ACC AGT ACG AAA ACA 604 
Leu Phe Asp Asn Ala Asp Gly Arg Phe Glu Gly Thr Ser Thr Lys Thr 
135 140 14-5 150 

GTC ATC AGG TAC AAT GGC ACT GTC ACC TGG ACT CCA CCG GCA AAC TAC 652 
Val lie Arg Tyr Asn Gly Thr Val Thr Trp Thr Pro Pro Ala Asn Tyr 
155 160 165 

AAA AGT TCC TGT ACC ATA GAT GTC ACG TTT TTC CCA TTT GAC CTT CAG 7 00 

Lys Ser Ser Cys Thr lie Asp Val Thr Phe Phe Pro Phe Asp Leu Gin 
170 175 180 

AAC TGT TCC ATG AAA TTT GGT TCT TGG ACT TAT GAT GGA TCA CAG GTT 74 8 

Asn Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Ser Gin Val 
185 190 195 

GAT ATA ATT CTA GAG GAC CAA GAT GTA GAC AAG AGA GAT TTT TTT GAT 79 6 

Asp lie lie Leu Glu Asp Gin Asp Val Asp Lys Arg Asp Phe Phe Asp 
200 205 210 

AAT GGA GAA TGG GAG ATT GTG AGT GCA ACA GGG AGC AAA GGA AAC AGA 844 
Asn Gly Glu Trp Glu lie Val Ser Ala Thr Gly Ser Lys Gly Asn Arg 
215 220 225 230 

ACC GAC AGC TGT TGC TGG TAT CCG TAT GTC ACT TAC TCA TTT GTA ATC 892 
Thr Asp Ser Cys Cys Trp Tyr Pro Tyr Val Thr Tyr Ser Phe Val lie 
235 240 245 

AAG CGC CTG CCT CTC TTT TAT ACC TTG TTC CTT ATA ATA CCC TGT ATT 940 
Lys Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu He He Pro Cys He 
250 255 260 

GGG CTC TCA TTT TTA ACT GTA CTT GTC TTC TAT CTT CCT TCA AAT GAA 988 
Gly Leu Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asn Glu 
265 270 275 

GGT GAA AAG ATT TGT CTC TGC ACT TCA GTA CTT GTG TCT TTG ACT GTC 103 6 

Gly Glu Lys He Cys Leu Cys Thr Ser Val Leu Val Ser Leu Thr Val 
280 285 290 

TTC CTT CTG GTT ATT GAA GAG ATC ATA CCA TCA TCT TCA AAA GTC ATA 1084 
Phe Leu Leu Val He Glu Glu He He Pro Ser Ser Ser Lys Val He 
295 300 305 310 
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CCT CTA ATT GGA GAG TAT CTG GTA TTT ACC ATG ATT TTT GTG AC A CTG 1132 
Pro Leu lie Gly Glu Tyr Leu Val Phe Thr Met lie Phe Val Thr Leu 
315 320 325 

TCA ATT ATG GTA ACC GTC TTC GCT ATC AAC ATT CAT CAT CGT TCT TCC 1180 
Ser lie Met Val Thr Val Phe Ala lie Asn lie His His Arg Ser Ser 
330 335 340 

TCA ACA CAT AAT GCC ATG GCG CCT TTG GTC CGC AAG ATA TTT CTT CAC 122 8 

Ser Thr His Asn Ala Met Ala Pro Leu Val Arg Lys lie Phe Leu His 
345 350 355 

ACG CTT CCC AAA CTG CTT TGC ATG AGA AGT CAT GTA GAC AGG TAC TTC 127 6 

Thr Leu Pro Lys Leu Leu Cys Met Arg Ser His Val Asp Arg Tyr Phe 
360 365 370 

ACT CAG AAA GAG GAA ACT GAG AGT GGT AGT GGA CCA AAA TCT TCT AGA 1324 
Thr Gin Lys Glu Glu Thr Glu Ser Gly Ser Gly Pro Lys Ser Ser Arg 
375 380 385 390 

AAC ACA TTG GAA GGT GCG CTC AAT TCT ATT CGC TAC ATT ACA AGA GAC 1372 
Asn Thr Leu Glu Ala Ala Leu Asn Ser lie Arg Tyr lie Thr Arg His 
395 400 405 

ATC ATG AAG GAA AAT GAT GTC CGT GAG GTT GTT GAA GAT TGG AAA TTC 142 0 

lie Met Lys Glu Asn Asp Val Arg Glu Val Val Glu Asp Trp Lys Phe 
410 415 420 

ATA GCC CAG GTT CTT GAT CGG ATG TTT CTG TGG ACT TTT CTT TTC GTT 1468 
lie Ala Gin Val Leu Asp Arg Met Phe Leu Trp Thr Phe Leu Phe Val 
425 430 435 

TCA ATT GTT GGA TCT CTT GGG CTT TTT GTT CCT GTT ATT TAT AAA TGG 1516 
Ser lie Val Gly Ser Leu Gly Leu Phe Val Pro Val lie Tyr Lys Trp 
440 445 450 

GCA AAT ATA TTA ATA CCA GTT CAT ATT GGA AAT GCA AAT AAG TGAAGCCTCC 1568 
Ala Asn lie Leu lie Pro Val His lie Gly Asn Ala Asn Lys 
455 460 465 

CAAGGGACTG AAGTATACAT TTAGTTAACA CACATATATC TGATGGCACC TATAAAATTA 1628 

TGAAAATGTA AGTTATGTGT TAAATTTAGT GCAAGCTTTA AC AGAC TAAG TTGCTAACCT 1688 

CAATTTATGT TAACAGATGA TCCATTTGAA CAGTTGGCTG T ATG AC TG AA GTAATAACTG 1748 

ATGAGATACA TTTG ATC TTG TAAAAATAGC AAAATATTAT C TG AAC TGG A CTAGTGAAAA 1808 

ATCTAGTATT TGTATCCTGG 182 8 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
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Met Ala Ala Arg Gly Ser Gly Pro Arg Ala Leu Arg Leu Leu Leu Leu 
15 10 15 

Val Gin Leu Val Ala Gly Arg Cys Gly Leu Ala Gly Ala Ala Gly Gly 
20 25 30 

Ala Gin Arg Gly Leu Ser Glu Pro Ser Ser lie Ala Lys His Glu Asp 
35 40 45 

Ser Leu Leu Lys Asp Leu Phe Gin Asp Tyr Glu Arg Trp Val Arg Pro 
50 55 60 

Val Glu His Leu Asn Asp Lys lie Lys lie Lys Phe Gly Leu Ala lie 
65 70 75 80 

Ser Gin Leu Val Asp Val Asp Glu Lys Asn Gin Leu Met Thr Thr Asn 
85 90 95 

Val Trp Leu Lys Gin Glu Trp lie Asp Val Lys Leu Arg Trp Asn Pro 
100 105 110 

Asp Asp Tyr Gly Gly lie Lys Val lie Arg Val Pro Ser Asp Ser Val 
115 120 125 

Trp Thr Pro Asp lie Val Leu Phe Asp Asn Ala Asp Gly Arg Phe Glu 
130 135 140 

Gly Thr Ser Thr Lys Thr Val lie Arg Tyr Asn Gly Thr Val Thr Trp 
145 150 155 160 

Thr Pro Pro Ala Asn Tyr Lys Ser Ser Cys Thr lie Asp Val Thr Phe 
165 170 175 

Phe Pro Phe Asp Leu Gin Asn Cys Ser Met Lys Phe Gly Ser Trp Thr 
180 185 190 

Tyr Asp Gly Ser Gin Val Asp lie lie Leu Glu Asp Gin Asp Val Asp 
195 200 205 

Lys Arg Asp Phe Phe Asp Asn Gly Glu Trp Glu lie Val Ser Ala Thr 
210 215 220 

Gly Ser Lys Gly Asn Arg Thr Asp Ser Cys Cys Trp Tyr Pro Tyr Val 
225 230 235 240 

Thr Tyr Ser Phe Val lie Lys Arg Leu Pro Leu Phe Tyr Thr Leu Phe 
245 250 255 

Leu lie lie Pro Cys lie Gly Leu Ser Phe Leu Thr Val Leu Val Phe 
260 265 270 

Tyr Leu Pro Ser Asn Glu Gly Glu Lys lie Cys Leu Cys Thr Ser Val 
275 280 285 

Leu Val Ser Leu Thr Val Phe Leu Leu Val lie Glu Glu lie lie Pro 
290 295 300 

Ser Ser Ser Lys Val lie Pro Leu lie Gly Glu Tyr Leu Val Phe Thr 
305 310 315 320 

Met lie Phe Val Thr Leu Ser lie Met Val Thr Val Phe Ala lie Asn 
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325 330 335 

lie His His Arg Ser Ser Ser Thr His Asn Ala Met Ala Pro Leu Val 
340 345 350 

Arg Lys lie Phe Leu His Thr Leu Pro Lys Leu Leu Cys Met Arg Ser 
355 360 365 

His Val Asp Arg Tyr Phe Thr Gin Lys Glu Glu Thr Glu Ser Gly Ser 
370 375 380 

Gly Pro Lys Ser Ser Arg Asn Thr Leu Glu Ala Ala Leu Asn Ser lie 
385 390 395 400 

Arg Tyr lie Thr Arg His lie Met Lys Glu Asn Asp Val Arg Glu Val 
405 410 415 

Val Glu Asp Trp Lys Phe lie Ala Gin Val Leu Asp Arg Met Phe Leu 
420 425 430 

Trp Thr Phe Leu Phe Val Ser Xle val Gly Ser Leu Gly Leu Phe Val 
435 440 445 

Pro Val lie Tyr Lys Trp Ala Asn lie Leu lie Pro Val His lie Gly 
450 455 460 

Asn Ala Asn Lys 
465 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 143.. 1627 

(D) OTHER INFORMATION: /product^ "ALPHA- 6 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CGGGTTTTGA TTTC TGAGAA GACACACACG GATTGCAGTG GGCTTCTGAT GATGTCAAGG 6 0 

TTGGATGCAT GTGGCTGACT GATAGCTCTT TGTTTTCCAC AATCCTTTGC CTAGGAAAAA 12 0 

GGAATCCAAG TGTGTTTTAA CC ATG CTG ACC AGC AAG GGG CAG GGA TTC CTT 172 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu 
15 10 

CAT GGG GGC TTG TGT CTC TGG CTG TGT GTG TTC ACA CCT TTC TTT AAA 22 0 
His Gly Gly Leu Cys Leu Trp Leu Cys Val Phe Thr Pro Phe Phe Lys 
15 20 25 

GGC TGT GTG GGC TGT GCA ACT GAG GAG AGG CTC TTC CAC AAA CTG TTT 2 68 
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Gly Cys Val Gly Cys Ala Thr Glu Glu Arg Leu Phe His Lys Leu Phe 
30 35 40 

TCT CAT TAC AAC CAG TTC ATC AGG CCT GTG GAA AAC GTT TCC GAC CCT 316 
Ser His Tyr Asn Gin Phe lie Arg Pro Val Glu Asn Val Ser Asp Pro 
45 50 55 

GTC ACG GTA CAC TTT GAA GTG GCC ATC ACC CAG CTG GCC AAC GTG GAT 3 64 

Val Thr Val His Phe Glu Val Ala lie Thr Gin Leu Ala Asn Val Asp 
60 65 70 

GAA GTA AAC CAG ATC ATG GAA ACC AAT TTG TGG CTG CGT CAC ATC TGG 412 
Glu Val Asn Gin lie Met Glu Thr Asn Leu Trp Leu Arg His lie Trp 
75 80 85 90 

AAT GAT TAT AAA TTG CGC TGG GAT CCA ATG GAA TAT GAT GGC ATT GAG 460 
Asn Asp Tyr Lys Leu Arg Trp Asp Pro Met Glu Tyr Asp Gly lie Glu 
95 100 105 

ACT CTT CGC GTT CCT GCA GAT AAG ATT TGG AAG CCC GAC ATT GTT CTC 5 08 

Thr Leu Arg Val Pro Ala Asp Lys lie Trp Lys Pro Asp lie Val Leu 
110 115 120 

TAT AAC AAT GCT GTT GGT GAC TTC CAA GTA GAA GGC AAA AC A AAA GCT 556 
Tyr Asn Asn Ala Val Gly Asp Phe Gin Val Glu Gly Lys Thr Lys Ala 
125 130 135 

CTT CTT AAA TAC AAT GGC ATG ATA ACC TGG ACT CCA CCA GCT ATT TTT 604 
Leu Leu Lys Tyr Asn Gly Met lie Thr Trp Thr Pro Pro Ala lie Phe 
140 145 150 

AAG AGT TCC TGC CCT ATG GAT ATC ACC TTT TTC CCT TTT GAT CAT CAA 652 
Lys Ser Ser Cys Pro Met Asp lie Thr Phe Phe Pro Phe Asp His Gin 
155 160 165 170 

AAC TGT TCC CTA AAA TTT GGT TCC TGG ACG TAT GAC AAA GCT GAA ATT 7 00 

Asn Cys Ser Leu Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Glu lie 
175 180 185 

GAT CTT CTA ATC ATT GGA TCA AAA GTG GAT ATG AAT GAT TTT TGG GAA 7 48 

Asp Leu Leu lie lie Gly Ser Lys Val Asp Met Asn Asp Phe Trp Glu 
190 195 200 

AAC AGT GAA TGG GAA ATC ATT GAT GCC TCT GGC TAC AAA CAT GAC ATC 7 96 

Asn Ser Glu Trp Glu lie lie Asp Ala Ser Gly Tyr Lys His Asp lie 
205 210 215 

AAA TAC AAC TGT TGT GAA GAG ATA TAC ACA GAT ATA ACC TAT TCT TTC 844 
Lys Tyr Asn Cys Cys Glu Glu lie Tyr Thr Asp lie Thr Tyr Ser Phe 
220 225 230 

TAC ATT AGA AGA TTG CCG ATG TTT TAC ACG ATT AAT CTG ATC ATC CCT 892 
Tyr lie Arg Arg Leu Pro Met Phe Tyr Thr lie Asn Leu lie lie Pro 
235 240 245 250 

TGT CTC TTT ATT TCA TTT CTA ACC GTG TTG GTC TTT TAC CTT CCT TCG 940 
Cys Leu Phe lie Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser 
255 260 265 

GAC TGT GGT GAA AAA GTG ACG CTT TGT ATT TCA GTC CTG CTT TCT CTG 988 
Asp Cys Gly Glu Lys Val Thr Leu Cys lie Ser Val Leu Leu Ser Leu 
270 275 280 
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ACT GTG TTT TTG CTG GTC ATC AC A GAA ACC ATC CCA TCC AC A TCT CTG 103 6 

Thr Val Phe Leu Leu Val lie Thr Glu Thr lie Pro Ser Thr Ser Leu 
285 290 295 

GTG GTC CCA CTG GTG GGT GAG TAC CTG CTG TTC ACC ATG ATC TTT GTC 1084 
Val Val Pro Leu Val Gly Glu Tyr Leu Leu Phe Thr Met lie Phe Val 
300 305 310 

ACA CTG TCC ATC GTG GTG ACT GTG TTT GTG TTG AAC ATA CAC TAC CGC 1132 
Thr Leu Ser lie Val Val Thr Val Phe Val Leu Asn lie His Tyr Arg 
315 320 325 330 

ACC CCA ACC ACG CAC ACA ATG CCC AGG TGG GTG AAG ACA GTT TTC CTG 118 0 

Thr Pro Thr Thr His Thr Met Pro Arg Trp Val Lys Thr Val Phe Leu 
335 340 345 

AAG CTG CTG CCC CAG GTC CTG CTG ATG AGG TGG CCT CTG GAC AAG ACA 122 8 

Lys Leu Leu Pro Gin Val Leu Leu Met Arg Trp Pro Leu Asp Lys Thr 
350 355 360 

AGG GGC ACA GGC TCT GAT GCA GTG CCC AGA GGC CTT GCC AGG AGG CCT 127 6 

Arg Gly Thr Gly Ser Asp Ala Val Pro Arg Gly Leu Ala Arg Arg Pro 
365 370 375 

GCC AAA GGC AAG CTT GCA AGC CAT GGG GAA CCC AGA CAT CTT AAA GAA 132 4 

Ala Lys Gly Lys Leu Ala Ser His Gly Glu Pro Arg His Leu Lys Glu 
380 385 390 

TGC TTC CAT TGT CAC AAA TCA AAT GAG CTT GCC ACA AGC AAG AGA AGA 137 2 

Cys Phe His Cys His Lys Ser Asn Glu Leu Ala Thr Ser Lys Arg Arg 
395 400 405 410 

TTA AGT CAT CAG CCA TTA CAG TGG GTG GTG GAA AAT TCG GAG CAC TCG 142 0 

Leu Ser His Gin Pro Leu Gin Trp Val Val Glu Asn Ser Glu His Ser 
415 420 425 

CCT GAA GTT GAA GAT GTG ATT AAC AGT GTT CAG TTC ATA GCA GAA AAC 146 8 

Pro Glu Val Glu Asp Val lie Asn Ser Val Gin Phe lie Ala Glu Asn 
430 435 440 

ATG AAG AGC CAC AAT GAA ACC AAG GAG GTA GAA GAT GAC TGG AAA TAC 1516 
Met Lys Ser His Asn Glu Thr Lys Glu Val Glu Asp Asp Trp Lys Tyr 
445 450 455 

GTG GCC ATG GTG GTG GAC AGA GTA TTT CTT TGG GTA TTT ATA ATT GTC 1564 
Val Ala Met Val Val Asp Arg Val Phe Leu Trp Val Phe lie lie Val 
460 465 470 

TGT GTA TTT GGA ACT GCA GGG CTA TTT CTA CAG CCA CTA CTT GGG AAC 1612 
Cys Val Phe Gly Thr Ala Gly Leu Phe Leu Gin Pro Leu Leu Gly Asn 
475 480 485 490 

ACA GGA AAA TCT TAAAATGTAT TTTC TTTTAT GTTC AG AAAT TTACAGACAC 1664 
Thr Gly Lys Ser 

495 

CATATTTGTT CTGCATTCCC TGCCACAAGG AAAGGAAAGC AAAGGC TTC C CACCCAAGTC 1724 
CCCCATCTGC TAAAACCCG 1743 
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(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 494 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu His Gly Gly Leu Cys Leu 
15 10 15 

Trp Leu Cys Val Phe Thr Pro Phe Phe Lys Gly Cys Val Gly Cys Ala 
20 25 30 

Thr Glu Glu Arg Leu Phe His Lys Leu Phe Ser His Tyr Asn Gin Phe 
35 40 45 

lie Arg Pro Val Glu Asn Val Ser Asp Pro Val Thr Val His Phe Glu 
50 55 60 

Val Ala lie Thr Gin Leu Ala Asn Val Asp Glu Val Asn Gin lie Met 
65 70 75 80 

Glu Thr Asn Leu Trp Leu Arg His lie Trp Asn Asp Tyr Lys Leu Arg 
85 90 95 

Trp Asp Pro Met Glu Tyr Asp Gly lie Glu Thr Leu Arg Val Pro Ala 
100 105 110 

Asp Lys lie Trp Lys Pro Asp lie Val Leu Tyr Asn Asn Ala Val Gly 
115 120 125 

Asp Phe Gin Val Glu Gly Lys Thr Lys Ala Leu Leu Lys Tyr Asn Gly 
130 135 140 

Met lie Thr Trp Thr Pro Pro Ala lie Phe Lys Ser Ser Cys Pro Met 
145 150 155 160 

Asp lie Thr Phe Phe Pro Phe Asp His Gin Asn Cys Ser Leu Lys Phe 
165 170 175 

Gly Ser Trp Thr Tyr Asp Lys Ala Glu lie Asp Leu Leu lie lie Gly 
180 185 190 

Ser Lys Val Asp Met Asn Asp Phe Trp Glu Asn Ser Glu Trp Glu lie 
195 200 205 

lie Asp Ala Ser Gly Tyr Lys His Asp lie Lys Tyr Asn Cys Cys Glu 
210 215 220 

Glu lie Tyr Thr Asp lie Thr Tyr Ser Phe Tyr lie Arg Arg Leu Pro 
225 230 235 240 

Met Phe Tyr Thr He Asn Leu He He Pro Cys Leu Phe He Ser Phe 
245 250 255 

Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys Val 
260 265 270 
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Thr Leu Cys lie Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu Val 
275 280 285 

lie Thr Glu Thr lie Pro Ser Thr Ser Leu Val Val Pro Leu Val Gly 
290 295 300 

Glu Tyr Leu Leu Phe Thr Met lie Phe Val Thr Leu Ser lie Val Val 
305 310 315 320 

Thr Val Phe Val Leu Asn lie His Tyr Arg Thr Pro Thr Thr His Thr 
325 330 335 

Met Pro Arg Trp Val Lys Thr Val Phe Leu Lys Leu Leu Pro Gin Val 
340 345 350 

Leu Leu Met Arg Trp Pro Leu Asp Lys Thr Arg Gly Thr Gly Ser Asp 
355 360 365 

Ala Val Pro Arg Gly Leu Ala Arg Arg Pro Ala Lys Gly Lys Leu Ala 
370 375 380 

Ser His Gly Glu Pro Arg His Leu Lys Glu Cys Phe His Cys His Lys 
385 390 395 400 

Ser Asn Glu Leu Ala Thr Ser Lys Arg Arg Leu Ser His Gin Pro Leu 
405 410 415 

Gin Trp Val Val Glu Asn Ser Glu His Ser Pro Glu Val Glu Asp Val 
420 425 430 

lie Asn Ser Val Gin Phe lie Ala Glu Asn Met Lys Ser His Asn Glu 
435 440 445 

Thr Lys Glu Val Glu Asp Asp Trp Lys Tyr Val Ala Met Val Val Asp 
450 455 460 

Arg Val Phe Leu Trp Val Phe lie lie Val Cys Val Phe Gly Thr Ala 
465 470 475 480 

Gly Leu Phe Leu Gin Pro Leu Leu Gly Asn Thr Gly Lys Ser 
485 490 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 73.. 1581 

(D) OTHER INFORMATION: /produCt= 11 ALPHA- 7 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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GGCCGCAGGC 


GCAGGCCCGG 


GCGACAGCCG 


CCGGGACTCA 


ACATGCGCTG 


CTCGCCGGGA 


CTGCACGTGT 


CCCTGCAAGG 


C GAGTTC C AG 


TACAATCCCT 


TGGAGAGGCC 


CGTGGCCAAT 


CTGAGCCTCC 


TGCAGATCAT 


GG AC GTGGAT 


TGGCTGCAAA 


TGTC TTGG AC 


AGATCACTAT 


GTGAAGACTG 


TTCGTTTCCC 


AGATGGCCAG 


AGTGCTGATG 


AGCGCTTTGA 


CGCCACATTC 


CATTGCCAGT 


ACCTGCCTCC 


AGGCATATTC 


TTTCCCTTTG 


ATGTGCAGCA 


CTGCAAACTG 


TCCTTGGATC 


TGCAGATGCA 


GGAGGCAGAT 


GACCTAGTGG 


GAATCCCCGG 


CAAGAGGAGT 


TACCCCGATG 


TCACCTTCAC 


AGTGACCATG 


CTGCTGATCC 


CCTGTGTGCT 


CATCTCCGCC 


GATTCCGGGG 


AGAAGATTTC 


CCTGGGGATA 


CTGCTCGTGG 


C TGAGATC AT 


GCCCGCAACA 


TTCGCCAGCA 


C C ATGATC AT 


CGTGGGCCTC 


TACCACCACC 


ACGACCCCGA 


CGGGGGCAAG 


AACTGGTGCG 


CGTGGTTCCT 


SCGAATGAAG 


TGCCAGCACA 


AGCAGCGGCG 


CTGCAGCCTG 


CCGCCCGCCA 


GCAACGGGAA 


CCTGCTGTAC 


TGTGTCCCGA 


CCCCCGACTC 


TGGGGTAGTG 


GATGAGCACC 


TCCTGCACGG 


CGGGCAACCC 


CTGGAGGAGG 


TCCGCTACAT 


TGCCAATCGC 


TGCAGCGAGT 


GGAAGTTCGC 


CGCCTGTGTG 


GTCTTCACCA 


TCATCTGCAC 


CATCGGCATC 


GTGTCCAAAG 


ACTTTGCGTA 


ACCACGCCTG 


GCAAGGCCTT 


TGGCTTGGCG 


AGATTTGGGG 


AACTCCAGTG 


TTCCCTTCTG 


GCTGTCAGTC 


Vj X X"lVj -L T\\JS\T\ X 


CTPAfJCAPTT 


THTTTPATAT 

X VJ XXX V_ li X xi X 


CATCCGTACC 


ATCGGTCAGC 


AGGGC C AC TG 



AGAC GTGGAG CGCGCCGGCT CGCTGCAGCT 60 

GGCGTCTGGC TGGCGCTGGC CGCGTCGCTC 12 0 

AGGAAGCTTT ACAAGGAGCT GGTCAAGAAC 180 

GACTCGCAAC CACTCACCGT CTACTTCTCC 240 

GAGAAGAACC AAGTTTTAAC CACCAACATT 3 00 

TTACAGTGGA ATGTGTCAGA ATATC C AGGG 3 60 

ATTTGGAAAC CAGACATTCT TCTCTATAAC 42 0 

CACACTAACG TGTTGGTGAA TTCTTCTGGG 48 0 

AAGAGTTCCT GCTACATCGA TGTACGCTGG 540 

AAGTTTGGGT CCTGGTCTTA CGGAGGCTGG 600 

ATCAGTGGCT ATATC CCCAA TGGAGAATGG 6 60 

GAAAGGTTCT ATGAGTGCTG CAAAGAGCCC 72 0 

CGCCGCAGGA CGCTCTACTA TGGCCTCAAC 7 80 

CTCGCCCTGC TGGTGTTCCT GCTTCCTGCA 840 

ACAGTCTTAC TCTCTCTTAC CGTCTTCATG 900 

TCCGATTCGG TACCATTGAT AGCCCAGTAC 960 

TCGGTGGTGG T G AC GGTG AT CGTGCTGCAG 102 0 

ATGCCCAAGT GGACCAGAGT CATCCTTCTG 1080 

AGGCCCGGGG AGGACAAGGT GCGCCCGGCC 1140 

GCCAGTGTGG AGATGAGCGC CGTGGCGCCG 12 00 

ATCGGCTTCC GCGGCCTGGA CGGCGTGCAC 12 60 

TGTGGCCGCA TGGCCTGCTC CCCCACGCAC 132 0 

CCCGAGGGGG ACCCGGACTT GGCCAAGATC 13 8 0 

TTCCGCTGCC AGGACGAAAG CGAGGCGGTC 1440 

GTGGACCGCC TGTGCCTCAT GGCCTTCTCG 15 00 

CTGATGTCGG CTCCCAACTT CGTGGAGGCC 15 6 0 

GTTC TGTAC A TGTGGAAAAC TCACAGATGG 162 0 

GTGCTAATCC AGGACAGCAT TACACGCCAC 1680 

GTGTTGCTTA CGGTTTCTTT GTT AC TTT AG 1740 

TC TC AG AT GG GCTGATAGAT ATCCTTGGCA 1800 

AGTAGTCATT TTGCCCATTA GCCCACTGCC 1860 
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TGGAAAGC C C TTCGGA 



<2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 02 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Met Arg Cys Ser Pro Gly Gly Val Trp Leu Ala Leu Ala Ala Ser Leu 
15 10 15 

Leu His Val Ser Leu Gin Gly Glu Phe Gin Arg Lys Leu Tyr Lys Glu 
20 2 5 3 0 

Leu Val Lys Asn Tyr Asn Pro Leu Glu Arg Pro Val Ala Asn Asp Ser 
35 40 45 

Gin Pro Leu Thr Val Tyr Phe Ser Leu Ser Leu Leu Gin lie Met Asp 
50 55 60 

Val Asp Glu Lys Asn Gin Val Leu Thr Thr Asn lie Trp Leu Gin Met 
65 70 75 80 

Ser Trp Thr Asp His Tyr Leu Gin Trp Asn Val Ser Glu Tyr Pro Gly 
85 90 95 

Val Lys Thr Val Arg Phe Pro Asp Gly Gin lie Trp Lys Pro Asp lie 
100 105 110 

Leu Leu Tyr Asn Ser Ala Asp Glu Arg Phe Asp Ala Thr Phe His Thr 
115 120 125 

Asn Val Leu Val Asn Ser Ser Gly His Cys Gin Tyr Leu Pro Pro Gly 
130 135 140 

lie Phe Lys Ser Ser Cys Tyr lie Asp Val Arg Trp Phe Pro Phe Asp 
145 150 155 160 

Val Gin His Cys Lys Leu Lys Phe Gly Ser Trp Ser Tyr Gly Gly Trp 
165 170 175 

Ser Leu Asp Leu Gin Met Gin Glu Ala Asp lie Ser Gly Tyr lie Pro 
180 185 190 

Asn Gly Glu Trp Asp Leu Val Gly lie Pro Gly Lys Arg Ser Glu Arg 
195 200 205 

Phe Tyr Glu Cys Cys Lys Glu Pro Tyr Pro Asp Val Thr Phe Thr Val 
210 215 220 

Thr Met Arg Arg Arg Thr Leu Tyr Tyr Gly Leu Asn Leu Leu lie Pro 
225 230 235 240 

Cys Val Leu lie Ser Ala Leu Ala Leu Leu Val Phe Leu Leu Pro Ala 
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245 250 255 

Asp Ser Gly Glu Lys lie Ser Leu Gly lie Thr Val Leu Leu Ser Leu 
260 265 270 

Thr Val Phe Met Leu Leu Val Ala Glu lie Met Pro Ala Thr Ser Asp 
275 280 285 

Ser Val Pro Leu lie Ala Gin Tyr Phe Ala Ser Thr Met lie lie Val 
290 295 300 

Gly Leu Ser Val Val Val Thr Val lie Val Leu Gin Tyr His His His 
305 310 315 320 

Asp Pro Asp Gly Gly Lys Met Pro Lys Trp Thr Arg Val lie Leu Leu 
325 330 335 

Asn Trp Cys Ala Trp Phe Leu Arg Met Lys Arg Pro Gly Glu Asp Lys 
340 345 350 

Val Arg Pro Ala Cys Gin His Lys Gin Arg Arg Cys Ser Leu Ala Sex 
355 360 365 

Val Glu Met Ser Ala Val Ala Pro Pro Pro Ala Ser Asn Gly Asn Leu 
370 375 380 

Leu Tyr lie Gly Phe Arg Gly Leu Asp Gly Val His Cys Val Pro Thr 
385 390 395 400 

Pro Asp Ser Gly Val Val Cys Gly Arg Met Ala Cys Ser Pro Thr His 
405 410 415 

Asp Glu His Leu Leu His Gly Gly Gin Pro Pro Glu Gly Asp Pro Asp 
420 425 430 

Leu Ala Lys lie Leu Glu Glu Val Arg Tyr lie Ala Asn Arg Phe Arg 
435 440 445 

Cys Gin Asp Glu Ser Glu Ala Val Cys Ser Glu Trp Lys Phe Ala Ala 
450 455 460 

Cys Val Val Asp Arg Leu Cys Leu Met Ala Phe Ser Val Phe Thr lie 
465 470 475 480 

lie Cys Thr lie Gly lie Leu Met Ser Ala Pro Asn Phe Val Glu Ala 
485 490 495 

Val Ser Lys Asp Phe Ala 
500 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 265.. 1773 

(D) OTHER INFORMATION: /product^ " BETA- 2 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CTCCTCCCCC TCACCGTCCC AATTGTATTC CCTGGAAGAG CAGCCGGAAA AGCCTCCGCC 60 

TGCTCATACC AGGATAGGCA AGAAGCTGGT TTCTCCTCGC AGCCGGCTCC CTGAGGCCCA 12 0 

GGAACCACCG CGGCGGCCGG CACCACCTGG ACCCAGCTCC AGGCGGGCGC GGCTTCAGCA 180 

CCACGGACAG CGCCCCACCC GCGGCCCTCC CCCCGGCGGC GCGCTCCAGC CGGTGTAGGC 240 

GAGGCAGCGA GCTATGCCCG CGGC ATG GCC CGG CGC TGC GGC CCC GTG GCG 291 

Met Ala Arg Arg Cys Gly Pro Val Ala 
1 5 

CTG CTC CTT GGC TTC GGC CTC CTC CGG CTG TGC TCA GGG GTG TGG GGT 33 9 

Leu Leu Leu Gly Phe Gly Leu Leu Arg Leu Cys Ser Gly Val Trp Gly 
10 15 20 25 

ACG GAT ACA GAG GAG CGG CTG GTG GAG CAT CTC CTG GAT CCT TCC CGC 387 
Thr Asp Thr Glu Glu Arg Leu Val Glu His Leu Leu Asp Pro Ser Arg 
30 35 40 

TAC AAC AAG CTT ATC CGC CCA GCC ACC AAT GGC TCT GAG CTG GTG ACA 43 5 

Tyr Asn Lys Leu lie Arg Pro Ala Thr Asn Gly Ser Glu Leu Val Thr 
45 50 55 

GTA CAG CTT ATG GTG TCA CTG GCC CAG CTC ATC AGT GTG CAT GAG CGG 483 
Val Gin Leu Met Val Ser Leu Ala Gin Leu lie Ser Val His Glu Arg 
60 65 70 

GAG CAG ATC ATG ACC ACC AAT GTC TGG CTG ACC CAG GAG TGG GAA GAT 531 
Glu Gin lie Met Thr Thr Asn Val Trp Leu Thr Gin Glu Trp Glu Asp 
75 80 85 

TAT CGC CTC ACC TGG AAG CCT GAA GAG TTT GAC AAC ATG AAG AAA GTT 57 9 

Tyr Arg Leu Thr Trp Lys Pro Glu Glu Phe Asp Asn Met Lys Lys Val 
90 95 100 105 

CGG CTC CCT TCC AAA CAC ATC TGG CTC CCA GAT GTG GTC CTG TAC AAC 627 
Arg Leu Pro Ser Lys His He Trp Leu Pro Asp Val Val Leu Tyr Asn 
110 115 120 

AAT GCT GAC GGC ATG TAC GAG GTG TCC TTC TAT TCC AAT GCC GTG GTC 675 
Asn Ala Asp Gly Met Tyr Glu Val Ser Phe Tyr Ser Asn Ala Val Val 
125 130 135 

TCC TAT GAT GGC AGC ATC TTC TGG CTG CCG CCT GCC ATC TAC AAG AGC 72 3 

Ser Tyr Asp Gly Ser He Phe Trp Leu Pro Pro Ala He Tyr Lys Ser 
140 145 150 

GCA TGC AAG ATT GAA GTA AAG CAC TTC CCA TTT GAC CAG CAG AAC TGC 771 
Ala Cys Lys He Glu Val Lys His Phe Pro Phe Asp Gin Gin Asn Cys 
155 160 165 

ACC ATG AAG TTC CGT TCG TGG ACC TAC GAC CGC ACA GAG ATC GAC TTG 819 
Thr Met Lys Phe Arg Ser Trp Thr Tyr Asp Arg Thr Glu He Asp Leu 
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170 175 180 185 

GTG CTG AAG AGT GAG GTG GCC AGC CTG GAC GAC TTC AC A CCT AGT GGT 867 
Val Leu Lys Ser Glu Val Ala Ser Leu Asp Asp Phe Thr Pro Ser Gly 
190 195 200 

GAG TGG GAC ATC GTG GCG CTG CCG GGC CGG CGC AAC GAG AAC CCC GAC 915 
Glu Trp Asp lie Val Ala Leu Pro Gly Arg Arg Asn Glu Asn Pro Asp 
205 210 215 

GAC TCT ACG TAC GTG GAC ATC ACG TAT GAC TTC ATC ATT CGC CGC AAG 963 
Asp Ser Thr Tyr Val Asp lie Thr Tyr Asp Phe lie lie Arg Arg Lys 
220 225 230 

CCG CTC TTC TAC ACC ATC AAC CTC ATC ATC CCC TGT GTG CTC ATC ACC 1011 
Pro Leu Phe Tyr Thr lie Asn Leu lie lie Pro Cys Val Leu lie Thr 
235 240 245 

TCG CTA GCC ATC CTT GTC TTC TAC CTG CCA TCC GAC TGT GGC GAG AAG 1059 
Ser Leu Ala lie Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys 
250 255 2 60 2 65 

ATG ACG TTG TGC ATC TCA GTG CTG CTG GCG CTC ACG GTC TTC CTG CTG 1107 
Met Thr Leu Cys lie Ser Val Leu Leu Ala Leu Thr Val Phe Leu Leu 
270 275 280 

CTC ATC TCC AAG ATC GTG CCT CCC ACC TCC CTC GAC GTG CCG CTC GTC 1155 
Leu lie Ser Lys lie Val Pro Pro Thr Ser Leu Asp Val Pro Leu Val 
285 290 295 

GGC AAG TAC CTC ATG TTC ACC ATG GTG CTT GTC ACC TTC TCC ATC GTC 12 03 

Gly Lys Tyr Leu Met Phe Thr Met Val Leu Val Thr Phe Ser lie Val 
300 305 310 

ACC AGC GTG TGC GTG CTC AAC GTG CAC CAC CGC TCG CCC ACC ACG CAC 1251 
Thr Ser Val Cys Val Leu Asn Val His His Arg Ser Pro Thr Thr His 
315 320 325 

ACC ATG GCG CCC TGG GTG AAG GTC GTC TTC CTG GAG AAG CTG CCC GCG 12 99 

Thr Met Ala Pro Trp Val Lys Val Val Phe Leu Glu Lys Leu Pro Ala 
330 335 340 345 

CTG CTC TTC ATG CAG CAG CCA CGC CAT CAT TGC GCC CGT CAG CGC CTG 1347 
Leu Leu Phe Met Gin Gin Pro Arg His His Cys Ala Arg Gin Arg Leu 
350 355 360 

CGC CTG CGG CGA CGC CAG CGT GAG CGC GAG GGC GCT GGA GCC CTC TTC 13 95 

Arg Leu Arg Arg Arg Gin Arg Glu Arg Glu Gly Ala Gly Ala Leu Phe 
365 370 375 

TTC CGC GAA GCC CCA GGG GCC GAC TCC TGC ACG TGC TTC GTC AAC CGC 1443 
Phe Arg Glu Ala Pro Gly Ala Asp Ser Cys Thr Cys Phe Val Asn Arg 
380 385 390 

GCG TCG GTG CAG GGG TTG GCC GGG GCC TTC GGG GCT GAG CCT GCA CCA 1491 
Ala Ser Val Gin Gly Leu Ala Gly Ala Phe Gly Ala Glu Pro Ala Pro 
395 400 405 

GTG GCG GGC CCC GGG CGC TCA GGG GAG CCG TGT GGC TGT GGC CTC CGG 153 9 

Val Ala Gly Pro Gly Arg Ser Gly Glu Pro Cys Gly Cys Gly Leu Arg 
410 415 420 425 
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GAG GCG GTG GAC GGC GTG CGC TTC ATC GCA GAC CAC ATG CGG AGC GAG 1587 

Glu Ala Val Asp Gly Val Arg Phe lie Ala Asp His Met Arg Ser Glu 

430 435 440 

GAC GAT GAC CAG AGC GTG AGT GAG GAC TGG AAG TAC GTC GCC ATG GTG 163 5 

Asp Asp Asp Gin Ser Val Ser Glu Asp Trp Lys Tyr Val Ala Met Val 

445 450 455 

ATC GAC CGC CTC TTC CTC TGG ATC TTT GTC TTT GTC TGT GTC TTT GGC 16 83 

lie Asp Arg Leu Phe Leu Trp lie Phe Val Phe Val Cys Val Phe Gly 
460 465 470 

ACC ATC GGC ATG TTC CTG CAG CCT CTC TTC CAG AAC TAC ACC ACC ACC 1731 

Thr lie Gly Met Phe Leu Gin Pro Leu Phe Gin Asn Tyr Thr Thr Thr 



475 




480 




485 






ACC TTC CTC CAC TCA GAC CAC TCA GCC CCC AGC 
Thr Phe Leu His Ser Asp His Ser Ala Pro Ser 
490 495 500 


TCC AAG TGAGGCCCTT 
Ser Lys 


1780 


CCTCATCTCC 


ATGCTCTTTC 


ACCCTGCCAC 


CCiC xGCi'GC 


ACAGTAGTGT 


TGGGTGGAGG 


1840 


ATGGACGAGT 


GAGC TAC CAG 


GAAGAGGGGC 


GCTGCCCCCA 


CAGATCCATC 


CTTTTGCTTC 


1900 


ATC TGGAGTC 


CCTCCTCCCC 


CACGCCTCCA 


TC CAC AC AC A 


GCAGCTCCAA 


CCTGGAGGCT 


1960 


GGACCAACTG 


CTTTGTTTTG 


GCTGCTCTCC 


ATCTCTTGTA 


CCAGCCCAGG 


CAATAGTGTT 


2020 


GAGGAGGGGA 


GCAAGGCTGC 


TAAGTGGAAG 


ACAGAGATGG 


CAGAGCCATC 


CACCCTGAGG 


2080 


AGTGACGGGC 


AAGGGGC CAG 


GAAGGGGACA 


GGATTGTCTG 


CTGCCTCCAA 


GTCATGGGAG 


2140 


AAGAGGGGTA 


TAGGACAAGG 


GGTGGAAGGG 


CAGGAGCTCA 


CACCGCACCG 


GGCTGGCCTG 


2200 


ACACAATGGT 


AGCTCTGAAG 


GGAGGGGAAG 


AGAGAGGCCT 


GGGTGTGACC 


TGACACCTGC 


2260 


CGCTGCTTGA 


GTGGACAGCA 


GCTGGACTGG 


GTGGGCCCCA 


CAGTGGTCAG 


CGATTCCTGC 


2320 


CAAGTAGGGT 


TTAGCCGGGC 


CCCATGGTCA 


CAGACCCCTG 


GGGGAGGCTT 


CCAGCTCAGT 


2380 


CCCACAGCCC 


CTTGCTTC T A 


AGGGATCCAG 


AGACCTGCTC 


CAGATCCTCT 


TTCCCCACTG 


2440 


AAGAATTC 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ala Arg Arg Cys Gly Pro Val Ala Leu Leu Leu Gly Phe Gly Leu 
15 10 15 

Leu Arg Leu Cys Ser Gly Val Trp Gly Thr Asp Thr Glu Glu Arg Leu 
20 25 30 
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Val Glu His Leu 
35 

Ala Thr Asn Gly 
50 

Ala Gin Leu lie 
65 

Val Trp Leu Thr 



Glu Glu Phe Asp 
100 



Trp Leu Pro Asp 
115 

Val Ser Phe Tyr 
13 0 

Trp Leu Pro Pro 
145 

His Phe Pro Phe 



Thr Tyr Asp Arg 
180 



Ser Leu Asp Asp 
195 

Pro Gly Arg Arg 
210 

Thr Tyr Asp Phe 
225 

Leu lie lie Pro 



Tyr Leu Pro Ser 
260 

Leu Leu Ala Leu 
275 

Pro Thr Ser Leu 
290 

Met Val Leu Val 
305 

Val His His Arg 



Val Val Phe Leu 
340 

Arg His His Cys 



Leu Asp Pro Ser 
40 

Ser Glu Leu Val 
55 

Ser Val His Glu 
70 

Gin Glu Trp Glu 
85 



Asn Met Lys Lys 



Val Val Leu Tyr 
120 

Ser Asn Ala Val 
13 5 

Ala lie Tyr Lys 
150 

Asp Gin Gin Asn 
165 

Thr Glu lie Asp 



Phe Thr Pro Ser 
200 



Asn Glu Asn Pro 
215 

lie lie Arg Arg 
230 

Cys Val Leu lie 
245 

Asp Cys Gly Glu 



Thr Val Phe Leu 
280 

Asp Val Pro Leu 
295 

Thr Phe Ser lie 
310 

Ser Pro Thr Thr 
325 

Glu Lys Leu Pro 



Ala Arg Gin Arg 



Arg Tyr Asn Lys 



Thr Val Gin Leu 
60 

Arg Glu Gin lie 
75 



Asp Tyr Arg Leu 
90 

Val Arg Leu Pro 
105 

Asn Asn Ala Asp 



Val Ser Tyr Asp 
140 



Ser Ala Cys Lys 
155 

Cys Thr Met Lys 
170 

Leu Val Leu Lys 
185 

Gly Glu Trp Asp 



Asp Asp Ser Thr 
220 

Lys Pro Leu Phe 
235 

Thr Ser Leu Ala 
250 

Lys Met Thr Leu 
265 

Leu Leu lie Ser 



Val Gly Lys Tyr 
300 

Val Thr Ser Val 
315 

His Thr Met Ala 
330 

Ala Leu Leu Phe 
345 



Leu Arg Leu Arg 
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Leu lie Arg Pro 
45 

Met Val Ser Leu 



Met Thr Thr Asn 
80 



Thr Trp Lys Pro 
95 

Ser Lys His lie 
110 

Gly Met Tyr Glu 
125 

Gly Ser lie Phe 



lie Glu Val Lys 
160 



Phe Arg Ser Trp 
175 

Ser Glu Val Ala 
190 

lie Val Ala Leu 
205 

Tyr Val Asp lie 



Tyr Thr lie Asn 
240 

lie Leu Val Phe 
255 

Cys lie Ser Val 
270 

Lys lie Val Pro 
285 

Leu Met Phe Thr 



Cys Val Leu Asn 
320 

Pro Trp Val Lys 
335 

Met Gin Gin Pro 
350 

Arg Arg Gin Arg 
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355 360 365 

Glu Arg Glu Gly Ala Gly Ala Leu Phe Phe Arg Glu Ala Pro Gly Ala 
370 375 380 

Asp Ser Cys Thr Cys Phe Val Asn Arg Ala Ser Val Gin Gly Leu Ala 
385 390 395 400 

Gly Ala Phe Gly Ala Glu Pro Ala Pro Val Ala Gly Pro Gly Arg Ser 
405 410 415 

Gly Glu Pro Cys Gly Cys Gly Leu Arg Glu Ala Val Asp Gly Val Arg 
420 425 430 

Phe lie Ala Asp His Met Arg Ser Glu Asp Asp Asp Gin Ser Val Ser 
435 440 445 

Glu Asp Trp Lys Tyr Val Ala Met Val lie Asp Arg Leu Phe Leu Trp 
450 455 460 

lie Phe Val Phe Val Cys val Phe Gly Thr lie Gly Met Phe Leu Gin 
465 470 475 480 

Pro Leu Phe Gin Asn Tyr Thr Thr Thr Thr Phe Leu His Ser Asp His 
485 490 495 

Ser Ala Pro Ser Ser Lys 
500 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 98 1474 

(D) OTHER INFORMATION: /product= " BETA- 3 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCGGAACCCC TGTATTTTCT TTTCAAAACC CCCTTTTCCA GTGGAAATGC TCTGTTGTTA 60 

AAAAGGAAGA AACTGTCTTT C TGAAAC TG A CATCACG ATG CTC CCA GAT TTT ATG 115 

Met Leu Pro Asp Phe Met 
1 5 

CTG GTT CTC ATC GTC CTT GGC ATC CCT TCC TCA GCC ACC ACA GGT TTC 163 
Leu Val Leu lie Val Leu Gly He Pro Ser Ser Ala Thr Thr Gly Phe 
10 15 20 

AAC TCA ATC GCC GAA AAT GAA GAT GCC CTC CTC AGA CAT TTG TTC CAA 211 
Asn Ser He Ala Glu Asn Glu Asp Ala Leu Leu Arg His Leu Phe Gin 
25 30 35 
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GGT TAT CAG AAA TGG GTC CGC CCT GTA TTA CAT TCT AAT GAC ACC ATA 259 
Gly Tyr Gin Lys Trp Val Arg Pro Val Leu His Ser Asn Asp Thr He 
40 45 50. 

AAA GTA TAT TTT GGA TTG AAA ATA TCC CAG CTT GTA GAT GTG GAT GAA 3 07 

Lys Val Tyr Phe Gly Leu Lys He Ser Gin Leu Val Asp Val Asp Glu 
55 60 65 70 

AAG AAT CAG CTG ATG ACA ACC AAT GTG TGG CTC AAA CAG GAA TGG ACA 355 
Lys Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lys Gin Glu Trp Thr 
75 80 85 

GAC CAC AAG TTA CGC TGG AAT CCT GAT GAT TAT GGT GGG ATC CAT TCC 403 
Asp His Lys Leu Arg Trp Asn Pro Asp Asp Tyr Gly Gly He His Ser 
90 95 100 

ATT AAA GTT CCA TCA GAA TCT CTG TGG CTT CCT GAC ATA GTT CTC TTT 451 
He Lys Val Pro Ser Glu Ser Leu Trp Leu Pro Asp He Val Leu Phe 
105 110 115 

GAA AAT GCT GAC GGC CGC TTC GAA GGC TCC CTG ATG ACC AAG GTC ATC 499 
Glu Asn Ala Asp Gly Arg Phe Glu Gly Ser Leu Met Thr Lys Val He 
120 125 130 

GTG AAA TCA AAC GGA ACT GTT GTC TGG ACC CCT CCC GCC AGC TAC AAA 547 
Val Lys Ser Asn Gly Thr Val Val Trp Thr Pro Pro Ala Ser Tyr Lys 
135 140 145 150 

AGC TCC TGC ACC ATG GAC GTC ACG TTT TTC CCG TTC GAC CGA CAG AAC 595 
Ser Ser Cys Thr Met Asp Val Thr Phe Phe Pro Phe Asp Arg Gin Asn 
155 160 165 

TGC TCC ATG AAG TTT GGA TCC TGG ACT TAT GAT GGC ACC ATG GTT GAC 643 
Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Thr Met Val Asp 
170 175 180 

CTC ATT TTG ATC AAT GAA AAT GTC GAC AGA AAA GAC TTC TTC GAT AAC 691 
Leu He Leu He Asn Glu Asn Val Asp Arg Lys Asp Phe Phe Asp Asn 
185 190 195 

GGA GAA TGG GAA ATA CTG AAT GCA AAG GGG ATG AAG GGG AAC AGA AGG 73 9 

Gly Glu Trp Glu He Leu Asn Ala Lys Gly Met Lys Gly Asn Arg Arg 
200 205 210 

GAC GGC GTG TAC TCC TAT CCC TTT ATC ACG TAT TCC TTC GTC CTG AGA 7 87 

Asp Gly Val Tyr Ser Tyr Pro Phe He Thr Tyr Ser Phe Val Leu Arg 
215 220 225 230 

CGC CTG CCT TTA TTC TAT ACC CTC TTT CTC ATC ATC CCC TGC CTG GGG 83 5 

Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu He He Pro Cys Leu Gly 
235 240 245 

CTG TCT TTC CTA ACA GTT CTT GTG TTC TAT TTA CCT TCG GAT GAA GGA 883 
Leu Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Glu Gly 
250 255 260 

GAA AAA CTT TCA TTA TCC ACA TCG GTC TTG GTT TCT CTG ACA GTT TTC 931 
Glu Lys Leu Ser Leu Ser Thr Ser Val Leu Val Ser Leu Thr Val Phe 
265 270 275 

CTT TTA GTG ATT GAA GAA ATC ATC CCA TCG TCT TCC AAA GTC ATT CCT 979 
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Leu Leu Val lie Glu Glu lie lie Pro Ser Ser Ser Lys Val lie Pro 
280 285 290 

CTC ATT GGA GAG TAC CTG CTG TTC ATC ATG ATT TTT GTG ACC CTG TCC 1027 
Leu lie Gly Glu Tyr Leu Leu Phe lie Met lie Phe Val Thr Leu Ser 
295 300 305 310 

ATC ATT GTT ACC GTG TTT GTC ATT AAC GTT CAC CAC AGA TCT TCT TCC 1075 
lie lie Val Thr Val Phe Val lie Asn Val His His Arg Ser Ser Ser 
315 320 325 

ACG TAC CAC CCC ATG GCC CCC TGG GTT AAG AGG CTC TTT CTG CAG AAA 1123 
Thr Tyr His Pro Met Ala Pro Trp Val Lys Arg Leu Phe Leu Gin Lys 
330 335 340 

CTT CCA AAA TTA CTT TGC ATG AAA GAT CAT GTG GAT CGC TAC TCA TCC 1171 
Leu Pro Lys Leu Leu Cys Met Lys Asp His Val Asp Arg Tyr Ser Ser 
345 350 355 

CCA GAG AAA GAG GAG AGT CAA CCA GTA GTG AAA GGC AAA GTC CTC GAA 1219 
Pro Glu Lys Glu Glu Ser Gin Pro Val Val Lys Gly Lys Val Leu Glu 
360 365 370 

AAA AAG AAA CAG AAA CAG CTT AGT GAT GGA GAA AAA GTT CTA GTT GCT 12 67 

Lys Lys Lys Gin Lys Gin Leu Ser Asp Gly Glu Lys Val Leu Val Ala 
375 380 385 390 

TTT TTG GAA AAA GCT GCT GAT TCC ATT AGA TAC ATT TCC AGA CAT GTG 1315 
Phe Leu Glu Lys Ala Ala Asp Ser lie Arg Tyr lie Ser Arg His Val 
395 400 405 

AAG AAA GAA CAT TTT ATC AGC CAG GTA GTA CAA GAC TGG AAA TTT GTA 13 63 

Lys Lys Glu His Phe lie Ser Gin Val Val Gin Asp Trp Lys Phe Val 
410 415 420 

GCT CAA GTT CTT GAC CGA ATC TTC CTG TGG CTC TTT CTG ATA GTG TCA 1411 
Ala Gin Val Leu Asp Arg lie Phe Leu Trp Leu Phe Leu lie Val Ser 
425 430 435 

GCA AC A GGC TCG GTT CTG ATT TTT ACC CCT GCT TTG AAG ATG TGG CTA 1459 
Ala Thr Gly Ser Val Leu lie Phe Thr Pro Ala Leu Lys Met Trp Leu 
440 445 450 



CAT AGT TAC 


! CAT TAGGAATTTC AAAAGACATA AGTAC TAAAT TACACCTTAG 


1511 


His Ser Tyr His 












455 














ACCTGACATC 


TGGC TATC AC 


ACAGACAGAA 


TCCAAATGCA 


TGTGCTTGTT 


CTACGAACCC 


1571 


CGAATGCGTT 


GTCTTTGTGG 


AAATGGAACA 


TC TC C TC ATG 


GGAGAAACTC 


TGGTAAATGT 


1631 


GCTCATTTGT 


GGTTGCCATG 


AGAGTGAGCT 


GCTTTTAAAG 


AAAGTGGAGC 


CTCCTCAGAC 


1691 


CCCTGCCTTG 


GCTTTCCCAG 


ACATTCAGGG 


AGGGATCATA 


GGTCCAGGCT 


TGAGC TC AC A 


1751 


TGTGGC C AGA 


GTGCACAAAA 


AGC TGTTGC T 


AC TTGGTGG A 


GGAACACCTC 


CTAGAAGCAG 


1811 


CAGGCCTCGG 


TGGTGGGGGA 


GGGGGGATTC 


AC C TGG AATT 


AAGGAAGTCT 


C GGTGTC GAG 


1871 


CTATCTGTGT 


GGGCAGAGCC 


TGGATCTCCC 


ACCCTGCACT 


GGCCTCCTTG 


GTGCCG 


1927 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Leu Pro Asp Phe Met Leu Val Leu lie Val Leu Gly lie Pro Ser 
15 10 15 

Ser Ala Thr Thr Gly Phe Asn Ser lie Ala Glu Asn Glu Asp Ala Leu 
20 25 30 

Leu Arg His Leu Phe Gin Gly Tyr Gin Lys Trp Val Arg Pro Val Leu 
35 40 45 

His Ser Asn Asp Thr lie Lys Val Tyr Phe Gly Leu Lys lie Ser Gin 
50 55 60 

Leu Val Asp Val Asp Glu Lys Asn Gin Leu Met Thr Thr Asn Val Trp 
65 70 75 80 

Leu Lys Gin Glu Trp Thr Asp His Lys Leu Arg Trp Asn Pro Asp Asp 
85 90 95 

Tyr Gly Gly lie His Ser lie Lys Val Pro Ser Glu Ser Leu Trp Leu 
100 105 110 

Pro Asp lie Val Leu Phe Glu Asn Ala Asp Gly Arg Phe Glu Gly Ser 
115 120 125 

Leu Met Thr Lys Val lie Val Lys Ser Asn Gly Thr Val Val Trp Thr 
130 135 140 

Pro Pro Ala Ser Tyr Lys Ser Ser Cys Thr Met Asp Val Thr Phe Phe 
145 150 155 160 

Pro Phe Asp Arg Gin Asn Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr 
165 170 175 

Asp Gly Thr Met Val Asp Leu lie Leu lie Asn Glu Asn Val Asp Arg 
180 185 190 

Lys Asp Phe Phe Asp Asn Gly Glu Trp Glu lie Leu Asn Ala Lys Gly 
195 200 205 

Met Lys Gly Asn Arg Arg Asp Gly Val Tyr Ser Tyr Pro Phe lie Thr 
210 215 220 

Tyr Ser Phe Val Leu Arg Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu 
225 230 235 240 

lie lie Pro Cys Leu Gly Leu Ser Phe Leu Thr Val Leu Val Phe Tyr 
245 250 255 

Leu Pro Ser Asp Glu Gly Glu Lys Leu Ser Leu Ser Thr Ser Val Leu 
260 265 270 
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Val Ser Leu Thr Val Phe Leu Leu Val lie Glu Glu lie lie Pro Ser 
275 280 285 

Ser Ser Lys Val lie Pro Leu lie Gly Glu Tyr Leu Leu Phe lie Met 
290 295 300 

lie Phe Val Thr Leu Ser lie lie Val Thr Val Phe Val lie Asn Val 
305 310 315 320 

His His Arg Ser Ser Ser Thr Tyr His Pro Met Ala Pro Trp Val Lys 
325 330 335 

Arg Leu Phe Leu Gin Lys Leu Pro Lys Leu Leu Cys Met Lys Asp His 
340 345 350 

Val Asp Arg Tyr Ser Ser Pro Glu Lys Glu Glu Ser Gin Pro Val Val 
355 360 365 

Lys Gly Lys Val Leu Glu Lys Lys Lys Gin Lys Gin Leu Ser Asp Gly 
370 3*75 3 80 

Glu Lys Val Leu Val Ala Phe Leu Glu Lys Ala Ala Asp Ser lie Arg 
385 390 395 400 

Tyr lie Ser Arg His Val Lys Lys Glu His Phe lie Ser Gin Val Val 
405 410 415 

Gin Asp Trp Lys Phe Val Ala Gin Val Leu Asp Arg lie Phe Leu Trp 
420 425 430 

Leu Phe Leu lie Val Ser Ala Thr Gly Ser Val Leu lie Phe Thr Pro 
435 440 445 

Ala Leu Lys Met Trp Leu His Ser Tyr His 
450 455 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 87 .. 1583 

(D) OTHER INFORMATION: /product= " BETA- 4 SUBUNIT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCGGCGCTCA CTCGACCGCG CGGCTCACGG GTGCCCTGTG ACCCCACAGC GGAGCTCGCG 60 

GCGGCTGCCA CCCGGCCCCG CCGGCCATGA GGCGCGCGCC TTCCCTGGTC CTTTTCTTCC 12 0 

TGGTCGCCCT TTGCGGGCGC GGGAACTGCC GCGTGGCCAA TGCGGAGGAA AAGCTGATGG 180 
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ACGACCTTCT 


GAACAAAACC 


CGTTACAATA 


AGCTCATCTC 


CATCAAGCTG 


CAGCTCTCCC 


AGCAGATCAT 


GACCACCAAT 


GTCTGGCTGA 


GGAACAGCTC 


CCGCTACGAG 


GGTGTGAACA 


TGCCTGACAT 


CGTGCTTTAC 


AACAACGCCG 


AC TTGAT AGT 


CCGGTCCAAC 


GGCAGCGTCC 


CCTGCAAGAT 


TGAGGTGAAG 


TACTTTCCCT 


GCTCCTGGAC 


CTATGACCAC 


AC GG AG AT AG 


TGGATGACTT 


TACTCCCAGT 


GGTGAGTGGG 


TGAACCCACA 


AGACCCCAGC 


T AC GTGG AC G 


CTC TGTTCTA 


CACCATCAAC 


CTCATCATCC 


TCGTCTTCTA 


CCTGCCATCC 


GACTGCGGCG 


TGGCACTGAC 


ATTCTTCCTG 


CTGCTCATCT 


TGCCTCTCAT 


CGGCAAGTAC 


CTCATGTTCA 


CCAGCGTCTG 


TGTGCTCAAT 


GTGCACCACC 


GGGTCAAGCG 


CTGCTTCCTG 


CACAAGCTGC 


CCGACAGCAG 


CCCGGCCAGA 


GCCTTCCCGC 


CCACCGCCAC 


CTCCACCAGC 


CCCTCCAACT 


CCGCCTCTGC 


AGCTTCCAAG 


TCTCCAGCCG 


TCTGGCTGCG 


GTCCTCTGGG 


AGGTTCCGAC 


GCTTCATCGC 


CCAGCACATG 


AAGAATGACG 


AGTACGTGGC 


TATGGTGGTG 


GACCGGCTGT 


TGGGCACTGT 


GGGGCTCTTC 


CTGCCGCCCC 


CCTACGCTGC 


CCAGCGTGAC 


TGAGGGCCCC 


CCGGGTGGGC 


AC TTTGC TGC 


TTCTTTCTGG 


GTGAGCATTG 


GCCATCAACC 


CCATCAAACC 


GGCCTGGCCT 


GTCCTCTCTG 


AATGCCTTGG 


GCTTCAGACA 


GTTCAATTCT 


GGCCTGTCTT 


ATGACTTCGT 


AGCAGCACCT 


ACTATGCTTC 



(2) INFORMATION FOR SEQ ID NO: 18 



ACCTGATCCG 


CCCAGCCACC 


AGCTCCTCAC 


240 


TGGCCCAGCT 


TATCAGCGTG 


AATGAGCGAG 


300 


AACAGGAATG 


GAC TGATTAC 


CGCCTGACCT 


360 


TCCTGAGGAT 


CCCTGCAAAG 


CGCATCTGGT 


420 


AC GGG AC C T A 


TGAGGTGTCT 


GTCTACACCA 


480 


TGTGGCTGCC 


CCCTGCCATC 


TACAAGAGCG 


540 


TCGACCAGCA 


GAACTGCACC 


CTCAAGTTCC 


600 


AC ATGGTC C T 


CATGACGCCC 


ACAGCCAGCA 


660 


ACATAGTGGC 


CCTCCCAGGG 


AGAAGGACAG 


720 


TG AC TT AC G A 


CTTCATCATC 


AAGCGCAAGC 


780 


CCTGCGTGCT 


CACCACCTTG 


CTGGCCATCC 


840 


AGAAGATGAC 


ACTGTGCATC 


TCAGTGCTGC 


900 


CCAAGATCGT 


GCCACCCACC 


TCCCTCGATG 


960 


CCATGGTGCT 


GGTCACCTTC 


TCCATCGTCA 


1020 


GCTCGCCCAG 


CACCCACACC 


ATGGCACCCT 


1080 


CTACCTTCCT 


CTTCATGAAG 


CGCCCTGGCC 


1140 


CCAGCAAGTC 


ATGCGTGACC 


AAGCCCGAGG 


1200 


TCTATGGGAA 


CTCCATGTAC 


TTTGTGAACC 


1260 


GCTCTACCCC 


GGTGGCTATC 


CCCAGGGATT 


1320 


AGGATGTGC A 


GGAGGCATTA 


GAAGGTGTCA 


1380 


ATGAAGACCA 


GAGTGTCGTT 


GAGGACTGGA 


1440 


TCCTGTGGGT 


GTTCATGTTT 


GTGTGCGTCC 


1500 


TCTTCCAGAC 


CCATGCAGCT 


TCTGAGGGGC 


1560 


CTGGGTTGTG 


GGGTGAGAGG 


ATGTGAGTGG 


1620 


GTTGTGGCCG 


ATGAGGCCCT 


AAGTAAATAT 


1680 


AGCCACAGCC 


GTGGAACAGG 


CAAGGATGGG 


1740 


AGGGATCCCA 


GGAAGCCCCA 


GTAGGAGGGA 


1800 


CCTTCCCTGC 


ACCGGGCAAT 


GGGGATAAAG 


1860 


AGGC ATGGTG 


CCGGCCTGCC 


TCTCC 


1915 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Arg Arg Ala Pro Ser Leu Val Leu Phe Phe Leu Val Ala Leu Cys 
15 10 15 

Gly Arg Gly Asn Cys Arg Val Ala Asn Ala Glu Glu Lys Leu Met Asp 
20 25 30 

Asp Leu Leu Asn Lys Thr Arg Tyr Asn Asn Leu lie Arg Pro Ala Thr 
35 40 45 

Ser Ser Ser Gin Leu lie Ser lie Lys Leu Gin Leu Ser Leu Ala Gin 
50 55 60 

Leu lie Ser Val Asn Glu Arg Glu Gin lie Met Thr Thr Asn Val Trp 
65 70 75 80 

Leu Lys Gin Glu Trp Thr Asp Tyr Arg Leu Thr Trp Asn Ser Ser Arg 
85 90 95 

Tyr Glu Gly Val Asn lie Leu Arg lie Pro Ala Lys Arg lie Trp Leu 
100 105 110 

Pro Asp lie Val Leu Tyr Asn Asn Ala Asp Gly Thr Tyr Glu Val Ser 
115 120 125 

Val Tyr Thr Asn Leu lie Val Arg Ser Asn Gly Ser Val Leu Trp Leu 
130 135 140 

Pro Pro Ala lie Tyr Lys Ser Ala Cys Lys lie Glu Val Lys Tyr Phe 
145 150 155 160 

Pro Phe Asp Gin Gin Asn Cys Thr Leu Lys Phe Arg Ser Trp Thr Tyr 
165 170 175 

Asp His Thr Glu lie Asp Met Val Leu Met Thr Pro Thr Ala Ser Met 
180 185 190 

Asp Asp Phe Thr Pro Ser Gly Glu Trp Asp lie Val Ala Leu Pro Gly 
195 200 205 

Arg Arg Thr Val Asn Pro Gin Asp Pro Ser Tyr Val Asp Val Thr Tyr 
210 215 220 

Asp Phe lie lie Lys Arg Lys Pro Leu Phe Tyr Thr lie Asn Leu lie 
225 230 235 240 

lie Pro Cys Val Leu Thr Thr Leu Leu Ala lie Leu Val Phe Tyr Leu 
245 250 255 

Pro Ser Asp Cys Gly Glu Lys Met Thr Leu Cys lie Ser Val Leu Leu 
260 265 270 

Ala Leu Thr Phe Phe Leu Leu Leu lie Ser Lys lie Val Pro Pro Thr 
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275 280 285 

Ser Leu Asp Val Pro Leu lie Gly Lys Tyr Leu Met Phe Thr Met Val 
290 295 300 

Leu Val Thr Phe Ser lie Val Thr Ser Val Cys Val Leu Asn Val His 
305 310 315 320 

His Arg Ser Pro Ser Thr His Thr Met Ala Pro Trp Val Lys Arg Cys 
325 330 335 

Phe Leu His Lys Leu Pro Thr Phe Leu Phe Met Lys Arg Pro Gly Pro 
340 345 350 

Asp Ser Ser Pro Ala Arg Ala Phe Pro Pro Ser Lys Ser Cys Val Thr 
355 360 365 

Lys Pro Glu Ala Thr Ala Thr Ser Thr Ser Pro Ser Asn Phe Tyr Gly 
370 375 380 



Asn oer iiec Tyr Phe Val Asn Pro Ala Ser Ala Ala Ser Lys Ser Pro 
385 390 395 400 

Ala Gly Ser Thr Pro Val Ala lie Pro Arg Asp Phe Trp Leu Arg Ser 
405 410 415 

Ser Gly Arg Phe Arg Gin Asp Val Gin Glu Ala Leu Glu Gly Val Ser 
420 425 430 

Phe lie Ala Gin His Met Lys Asn Asp Asp Glu Asp Gin Ser Val Val 
435 440 445 

Glu Asp Trp Lys Tyr Val Ala Met Val Val Asp Arg Leu Phe Leu Trp 
450 455 460 

Val Phe Met Phe Val Cys Val Leu Gly Thr Val Gly Leu Phe Leu Pro 
465 470 475 480 

Pro Leu Phe Gin Thr His Ala Ala Ser Glu Gly Pro Tyr Ala Ala Gin 
485 490 495 

Arg Asp 
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